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(57) Abstract 



The present invention involves the use of fluorescence spectroscopy in the diagnosis of cervical cancer and precancer. Using multiple 
illumination wavelengths, it is possible to (Q differentiate normal or inflamed tissue from squamous intraepithelial lesions (SILs) and 00 
to differentiate high grade SILs from non-high grade SILs. The detection may be performed in vitro or in vivo. Multivariate statistical 
analysis was employed to reduce the number of fluorescence excitation-emission wavelength pairs needed to re-develop algorithms that 
demonstrate a minimum decrease in classification accuracy. Fluorescence at excitation-emission wavelength pain was used to redevelop 
and test screening and diagnostic algorithms that have a similar classification accuracy to those that employ fluorescence emission spectra at 
three excitation wavelengths. Both die full-parameter and reduced-parameter screening algorithms discriminate between SILs and non-SlLs 
with a similar specificity and a substantially improved sensitivity relative to Pap smear screening and differentiate high grade SILs from 
non-high grade SILs. 
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PESCRIPTIQN 

METHOD AND APPARATUS FOR DIAGNOSING SQUAMOUS INTRAEPITHELIAL LESIONS OF 
THE CERVIX USING FLUORESCENCE SPECTROSCOPY 

BACICKROIIND OF THE INVENTION 

I. Field of the Invention 

The invention relates to optical methods and apparatus used for the diagnosis of 
cervical precancers. 

IL R efrt e d Art 

There has been a significant decline in the incidence of advanced cervical cancer 
over the last 40 years, primarily due to the development of organized programs that 
target early detection of its curable precursor, cervical Squamous Intraepithelial Lesion 
(SIL) (SILs consist of Cervical Intraepithelial Neoplasia (CIN) and Human Papilloma 
Viral (HPV) infection) [1]. Even though organized screening (Pap smear) and diagnostic 
(colposcopy) programs are currently in place, approximately 15,900 new cases of 
cervical cancer and 4,900 cervical cancer related deaths were reported in 1995, in the 
United States alone [2], Currently, 24.5% of women with cervical cancer are under the 
age of 35 years, and the incidence continues to increase for women in this age group [1]. 
The continuing morbidity and mortality rate related to cervical cancer necessitates an 
improvement in the accuracy and efficacy of current detection modalities. 

The Pap smear is the primary screening tool for the detection of cervical cancer 
and its precursor [3], In a Pap test, a large number of cells obtained by scraping the 
cervical epithelium are smeared onto a slide which is then fixed and stained for cytologic 
examination. Each smear is then examined under a microscope for the presence of 
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neoplastic cells [4]. The Pap smear's reported sensitivity and specificity range from 11- 
99% and 14-97%, respectively. Like many screening tests in an asymptomatic 
population, the Pap smear is unable to achieve a concurrently high sensitivity and high 
specificity [5]. The accuracy of the Pap smear is limited by both sampling and reading 
errors [6]. Approximately 60% of false-negative smears are attributed to insufficient 
sampling; the remaining 40% are due to reading errors. Because of the monotony and 
fatigue associated with reading Pap smears (50,000-300,000 cells per slide), the 
American Society of Cytology has proposed that a cyto-technologist should be limited to 
evaluating no more than 12,000 smears annually [7]. As a result, accurate Pap smear 
screening is labor intensive and requires highly trained professionals. 

A patient with a Pap smear interpreted as indicating the presence of SIL is 
generally recommended for follow up with a diagnostic procedure called colposcopy [3]. 
During a colposcopic examination, the cervix is stained with acetic acid and viewed 
through a low power microscope to identify potential pre-cancerous sites; suspicious 
sites are biopsied and then histologically examined to confirm the presence, extent and 
severity of the SIL [8J. A patient who has high grade SIL (HG SIL) (which consists of 
CIN II and/or CIN III) is usually treated, whereas a patient diagnosed with low grade SIL 
(LG SIL) (which consists of HPV and/or CIN I) is generally followed further using 
colposcopy [3]. 

Colposcopic examination and tissue biopsy in expert hands maintains a high 
sensitivity (80-90%), at the expense of a significantly low specificity (50-60%) [9]. A 
poor specificity represents unnecessary biopsy of tissues which do not contain cervical 
pre-cancer. In spite of the poor specificity of this technique, extensive training is 
required to achieve this skill level. All biopsy specimens require histologic evaluation 
and, therefore, diagnosis is not immediate. The disconnection between colposcopic 
assessment and biopsy and definitive treatment is of particular concern in the 
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management of economically disadvantaged patients who may not return for treatment, 
particularly since cervical cancer precursors are more prevalent in groups of lower socio- 
economic status [1]. 

Fluorescence spectroscopy is a technique thai has the potential to improve the 
accuracy and efficacy of cervical pre-cancer screening and diagnosis. Fluorescence 
spectroscopy has the capability to quickly, non-invasively and quantitatively probe the 
biochemical and morphological changes that occur as tissue becomes neoplastic. The 
altered biochemical and morphological state of the neoplastic tissue is reflected in the 
spectral characteristics of the measured fluorescence. This spectral information can be 
correlated to tissue histopathology, the current "gold standard" to develop clinically 
effective screening and diagnostic algorithms. These mathematical algorithms can be 
implemented in software, thereby enabling automated, fast, non-invasive and accurate 
pre-cancer screening and diagnosis in the hands of non-experts. 

Although a complete understanding of the quantitative information contained 
within a tissue fluorescence spectrum has not been achieved, many groups have applied 
fluorescence spectroscopy for real-time, non-invasive, automated characterization of 
tissue pathology. Characterization of tissue patoology using auto-fluorescence [10-23] as 
well as photosensitizer induced fluorescence [24-27] to discriminate between diseased 
and non-diseased human tissues in vitro and in vivo has been described in a variety of 
tissues. 



Auto-fluorescence spectra of normal tissue, intraepithelial neoplasia and invasive 
carcinoma have been measured from several organ sites in vivo [13-17]. In vivo studies 
of the human colon at 370 ran excitation [13] indicated that a simple algorithm based on 
fluorescence intensity at two emission wavelengths can be used to differentiate normal 
colon and adenomatous polyps with a sensitivity and specificity of 100% and 97%, 
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respectively. Shomacker et al [14J conducted similar studies in vivo at 337 ran 
excitation and demonstrated that a multivariate linear regression algorithm based on laser 
induced fluorescence spectra can be used to discriminate between normal colon and 
colonic polyps with a similarly high sensitivity and specificity. Lam et al developed a 
5 bronchoscope which illuminates tissue at 442 nm excitation and produces a false color 
image in near real-time which represents the ratio of fluorescence intensities at 520 nm 
(green) and 690 nm (red) [16,17]. In vivo studies demonstrated that the ratio of red to 
green auto-fluorescence is greater in normal bronchial tissues than in abnormal bronchial 
tissues [16]. In a trial with 53 patients, the sensitivity of fluorescence bronchoscopy was 
1 0 found to be 72%, as compared to 50% for conventional white light bronchoscopy [ 1 7]. 

Nonetheless, a reliable diagnostic method with improved diagnostic capability for 
use in vitro and in vivo is needed to allow faster; more effective patient management and 
potentially further reduce mortality. 

15 

SUMMARY OF THE INVENTION 

The present invention demonstrates that fluorescence spectroscopy can be 
applied, both in vitro and in vivo, to the diagnosis of cervical tissue abnormalities 
20 including the clinical detection of cervical precancer. 

In a first exemplary embodiment, there is provided a method of detecting tissue 
abnormality in a tissue sample comprising the steps of (i) providing a tissue sample; (ii) 
illuminating said sample with electromagnetic radiation wavelengths of about 337 nm, 

25 about 380 nm and about 460 nm to produce three fluorescence intensity spectra; (iii) 
detecting a plurality of emission wavelengths from said fluorescence intensity spectra; 
and (iv) establishing from said emission wavelengths a probability that said sample is 
abnormal. The illumination wavelengths are advantageously in the ranges of 317-357 
nm, 360-400 nm and 440-480 nm. The method may further comprise preprocessing data 

30 at the emission wavelengths to reduce inter-sample and intra-sample variation. The 
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establishing step may comprise normalizing the spectra relative to a maximum intensity 
within the spectra. Optionally, the establishing step does not comprise mean-scaling the 
spectra. 

Emission wavelengths may be selected at about 410 nm, about 460 nm, about 510 
nm and about 580 nm for an illumination of about 337 nm; at about 460 nm, about 510 
nm, about 580 nm f about 600 nm and about 640 nm for an illumination of about 380 nm; 
and at about 510, about 580 nm, about 600 nm, about 620 nm, about 640 nm and about 
660 nm for an illumination of about 460 nm. 

In various embodiments, the method may use illumination in vitro or in vivo. 
Where the method is in vitro, the providing step may comprise obtaining the tissue 
sample by biopsy. Further, the in vitro method may comprise generating a monolayer 
cell touch preparation or a pellet and ethanol fixation of the tissue sample. The 
illumination may comprise illuminating the sample substantially normal to a surface of 
the sample, and wherein the detecting step comprises, detecting the spectra at an angle of 
approximately 20° from normal. 

The method is capable of distinguishing tissues as follows: normal squamous and 
abnormal tissue; normal columnar epithelium and abnormal tissue; inflamed and 
abnormal tissue; low grade SIL and high grade SIL tissue; and normal and high grade 
SIL tissue. 

The present invention also comprises a method of developing a model for 
differentiating normal from abnormal tissue in a tissue sample comprising the steps of (i) 
providing a plurality of tissue samples; (ii) illuminating the samples with electromagnetic 
radiation wavelengths of about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra; (Hi) detecting a plurality of emission wavelengths from 
the fluorescence intensity spectra; (iv) forming a set of principal components from the 
spectra, said principal components being defined as providing statistically significant 
differences between normal tissue and various forms of abnormal tissue; and (v) 
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subjecting said principal components to logistic discrimination to develop a relevant 
mathematical model. Again, the illumination wavelengths may be in the ranges of 317- 
357 nm, 360-400 nm and 440-480 nm. 

In still another embodiment, there is provided a method of detecting tissue 
abnormality in a tissue sample comprising the steps of (i) providing a tissue sample; (ii) 
illuminating the sample with electromagnetic radiation wavelengths of about 337 nm, 
about 380 nm and about 460 nm to produce three fluorescence intensity spectra; (iii) 
detecting a plurality of emission wavelengths from said fluorescence intensity spectra; 
and (iv) establishing from principal components PCI, PC3 and PC7 of step (m) a 
probability that the sample is abnormal. Alternatively, the principal components are 
PCI , PC2, PC4 and PCS or PCI, PC3 PC6 and PC8. 

In still yet another embodiment, there is provided a method of detecting tissue 
abnormality in a subject in vivo comprising the steps of (i) illuminating a tissue sample in 
vivo with a plurality of electromagnetic radiation wavelengths to produce a plurality of 
fluorescence intensity spectra; (ii) detecting a plurality of emission wavelengths from the 
fluorescence intensity spectra; and (iii) establishing from the emission wavelengths a 
probability that the sample is abnormal. The illumination wavelengths advantageously 
include at least three wavelengths having the ranges of about 317-357 nm, about 360- 
400 nm and about 440-480 nm. The method, may further comprise the step of 
preprocessing data at the emission wavelengths, to reduce inter-patient and intra-patient 
variation. 

In still yet another embodiment, there is provided a method for diagnosing 
cervical precancer in a patient comprising the steps of (i) illuminating cervical tissue of 
the patient with a plurality of electromagnetic radiation wavelengths to produce a 
plurality of fluorescence intensity spectra; (ii) detecting a plurality of emission 
wavelengths from said fluorescence intensity spectra; (iii) comparing the plurality of 
emission wavelengths from the patient with known emission wavelengths for normal 
tissue; and (iv) making a diagnostic prediction of the condition the cervical tissue. 
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In still yet another embodiment, the method of fluorescent spectroscopy is 
coupled with illuminating the sample with an electromagnetic radiation wavelength in 
the near infrared sufficient to produce a Raman spectrum and detecting a Raman 
5 spectrum therefrom. The near infrared illumination wavelength is about 789 nm. 
Advantageously, the emission frequencies are shifted about 626, 818, 978, 1070, 1 175, 
1246, 1330, 1454 and 1656 cm" 1 from the illumination wavelength. In a particular 
embodiment, at least one of the emission frequencies is associated with compound 
selected from the group consisting of collagen, phospholipids and glucose- 1 -phosphate. 

10 

The present invention also contemplates an apparatus for detecting tissue 
abnormality, comprising, a light source for emitting a plurality of electromagnetic 
radiation wavelengths, and a probe connected to the light source, the probe being adapted 
to apply the plurality of radiation wavelengths to tissue under test and to gather 
15 fluorescence emitted from the tissue under test. The apparatus also, includes means, 
connected to probe, for detecting at least one fluorescence spectrum emitted from the 
tissue under test, and a programmed computer connected to the detection means, for 
processing the at least one fluorescence spectrum according to a predetermined algorithm 
to establish a probability that the tissue under test is abnormal. 

20 

The light source may comprise a nitrogen pumped laser, and the plurality of 
electromagnetic radiation wavelengths may be about 337 nm, about 380 nm and about 
460 nm. 

25 The apparatus may further comprise a polychromator connected between the 

probe and the detection means. 

The probe may include emission optical fibers for applying the plurality of 
electromagnetic wavelengths to the tissue under test, collection optical fibers for 
30 gathering the fluorescence emitted from the tissue under test, and a shield overlying ends 
of the excitation optical fibers and collection optical fibers. 
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The predetermined algorithm may include principal components that predict 
statistically relevant differences between fluorescence emission wavelengths from 
normal and abnormal tissues for the plurality of applied electromagnetic radiation 
wavelengths. 

Finally, the abnormal tissues that are predicted may include inflamed tissue, low 
grade SIL and high grade SIL. 

Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, that 
the detailed description and the specific examples, while indicating specific embodiments 
of the invention, are given by way of illustration only, since various changes and 
modifications within the spirit and scope of the invention will become apparent to those 
skilled in the arc from this detailed description. 

BRIEF DESCRIPTION OF T HF PR AWINf^ 

The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of the specific embodiments presented herein. 

FIG. I. A schematic of the portable fluorimeter used to measure cervical tissue 
fluorescence spectra at three excitation wavelengths. 

FIG. 2. A flow chart of a formal analytical process used to develop the screening 
and diagnostic algorithms. The text in the dashed-Iine boxes represent 
mathematical steps implemented on the spectral data and the text in the solid line 
boxes represent outputs after each mathematical step (NS - normal squamous, NC 
- normal columnar, LG - LG SIL and HG - HG SIL). 
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FIGS. 3A, 3B and 3C. (FIG. 3A) Original and corresponding, (FIG. 3B) 
normalized and (FIG. 3C) normalized, mean-scaled spectra at 337 run excitation 
from a typical patient. 

FIGS. 4A, 4B and 4C. (FIG. 4A) Original and corresponding, (FIG. 4B) 
normalized and (FIG. 4C) normalized; mean-scaled spectra at 380 nm excitation 
from the same patient 

FIGS- 5A f 5B and 5C. (FIG. 5A) Original and corresponding, (FIG. 5B) 
normalized and (FIG. 5C) normalized, mean-scaled spectra at 460 nm excitation 
from the same patient. 

FIG, 6. A plot of the posterior probability of belonging to the SIL category of all 
SILs and normal squamous epithelia from the calibration set. Evaluation of the 
misclassified SILs indicates that one samples with CIN III, two with CIN II, two 
with CIN I and two with HPV are incorrectly classified. 

FIG. 1. A plot of the posterior probability of belonging to the SIL category of all 
SILs and normal columnar epithelia from the calibration data set. Evaluation of 
the misclassified SILs indicates that three samples with CIN II, three with CIN I 
and one with HPV are incorrectly classified. 

FIG. 8. A plot of the posterior probability of belonging to the HG SIL category 
of all SILs from the calibration set. Evaluation of the misclassified HG SILs 
indicates that three samples with CIN III and three with CIN are incorrectly 
classified as LG SILs; five samples with CIN I and two with HPV are 
misclassified as HG SIL. 



WO 97/48331 



PCT/US97/10729 



-10- 

FIGS. 9A, 9B and 9C. Component loadings (CL) of diagnostic principal 
components of constituent algorithm (1), obtained from normalized spectra at 
(FIG. 9A) 337, (FIG. 9B) 380 and (FIG. 9C) 460 nm excitation, respectively. 

FIGS. 10A, 10B and 10C. Component loadings (CL) of diagnostic principal 
components of constituent algorithm (2), obtained from normalized, mean-scaled 
spectra at (FIG. 10A) 337, (FIG. 10B) 380 and (FIG. 10C) 460 nm excitation, 
respectively. 

FIGS. 11A, 11B and 11C. Component loadings (CL) of diagnostic principal 
components of constituent algorithm (3), obtained from normalized spectra at (A) 
337, (B) 380 and (C) 460 nm excitation, respectively. 

DETAILED DESCRIPTION OF THF : INVFNT^ 

I. The Present Invention 

This present invention provides for the development and application of a 
detection technique for human cervical pre-cancer, both in vitro and in v/vo, based on 
laser induced fluorescence spectroscopy. Fluorescence spectra from 381 cervical 
samples in 95 patients were acquired at three excitation wavelengths: 337, 380 and 460 
run. A general multivariate statistical algorithm is then used to analyze and extract 
clinically useful information from tissue spectra acquired in vivo. The present invention 
includes a screening algorithm to discriminate between SILs and-non SILs (normal 
squamous and columnar epithelia and inflammation); and a diagnostic algorithm to 
differentiate high grade SILs from non-high grade SILs (low grade SILs, normal 
epithelia and inflammation). The retrospective and prospective accuracy of both the 
screening and diagnostic algorithms is compared to the accuracy of Pap smear screening 
[5] and to colposcopy in expert hands [9]. 
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The general multivariate statistical algorithm was developed and tested using 
cervical tissue spectra acquired at 337 nm excitation from 476 cervical sites in 92 
patients. This algorithm could be used to differentiate SILs and normal squamous tissues 
with an average sensitivity and specificity of 91%±2 and 78%±3, respectively. Spectra 
of normal columnar tissues and inflammation were indistinguishable from those of SILs 
at this single excitation wavelength. Furthermore, a multivariate statistical algorithm 
based solely on spectra at 337 nm excitation only could not discriminate between high 
grade SILs and low grade SILs effectively. 

However, multivariate statistical analysis of cervical tissue fluorescence spectra 
acquired in vivo at 380 nm and 460 nm excitation from a subset of the 92 patients 
indicated that spectra at these excitation wavelengths can overcome the limitations of 
spectra at 337 nm excitation. Spectra at 380 nm excitation from 165 sites in a first group 
of 40 patients could be used to differentiate SILs from normal columnar epithelia and 
inflammation with a sensitivity and specificity of 77%± 1 and 72%±9, respectively; 
spectra at 460 nm excitation from 149 sites in a second group of 24 patients could be 
used to differentiate high grade SILs from low grade SILs with a sensitivity and 
specificity of 80%±4 and 76%±5, respectively. 

There are two principal limitations of previous studies using fluorescence 
spectroscopy in the diagnosis of precancers. A first limitation is that fluorescence spectra 
were not acquired at all three excitation wavelengths (337, 380 and 460 nm) from every 
patient in the study. Therefore, analysis of spectral data from these studies did not 
indicate if the classification accuracy of each of the three constituent algorithms 
developed using spectra at a single excitation wavelength could be improved by utilizing 
tissue spectra at all three excitation wavelengths. A second limitation of these studies is 
that the accuracy of composite screening and diagnostic algorithms utilizing a 
combination of the constituent algorithms could not be evaluated since tissue spectra 
were not available at all three excitation wavelengths from the same group of patients. 
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In accordance with one embodiment, an algorithm based on normalized, mean- 
scaled spectra at 337 nm excitation can be used to differentiate between SILs and normal 
squamous tissues, while a second algorithm based on similarly pre-processed spectra at 
380 nm excitation can be used to differentiate SILs from normal columnar tissues and 
S samples with inflammation. A third algorithm, based on normalized tissue spectra at 460 
nm excitation, can be used to discriminate between low grade SILs and high grade SILs. 

Thus, a first goal of this analysis is to evaluate the accuracy of constituent and 
composite algorithms which address these questions. Fluorescence spectra acquired in 
10 vivo at all three excitation wavelengths from 381 cervical sites in 95 patients were 
analyzed to determine if the accuracy of each of the three constituent algorithms 
previously developed can be improved using tissue spectra at a combination of two or 
three excitation wavelengths rather than at a single excitation wavelength. 

15 A second goal of this analysis is to integrate the three independently developed 

constituent algorithms which discriminate between pairs of tissue types into composite 
screening and diagnostic algorithms that can achieve discrimination between many of the 
clinically relevant tissue types. The effective accuracy of a composite screening 
algorithm for the identification of SILs (normal epithelium and inflammation versus 

20 SIL) and a composite diagnostic algorithm for the identification of high grade SILs (non- 
high grade versus high grade) was evaluated. 

The. third goal of the analysis is to determine if fluorescence intensities at a 
reduced number of excitation-emission wavelength pairs can be used to re-develop 

25 constituent and composite algorithms that can achieve classification with a minimum 
decrease in predictive ability. A significant reduction in the number of required 
fluorescence excitation-emission wavelength pairs could enhance the development of a 
cost-effective clinical fluorimeter. The accuracy of the constituent and composite 
algorithms based on the reduced emission variables was compared to the accuracy of 

30 those that utilize entire fluorescence emission spectra. 
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This ultimate result of this analysis is the clinically applicable diagnostic method 
for diagnosing cervical precancer in accordance with the present invention. A patient 
may present in the clinic with an indication of disease or for a routine check up. The 
decision will be made to conduct an in vitro or in vivo diagnosis. If in vitro is selected, a 
tissue sample is taken from the cervix and prepared for examination. If in vivo is 
selected, the patient is examined directly. The general steps that follow are much the 
same. First, the cervical tissue is illuminated with a plurality of electromagnetic 
radiation wavelengths to produce a plurality of fluorescence intensity spectra. Second, a 
plurality of emission wavelengths from the fluorescence intensity spectra is detected 
Third, the plurality of emission wavelengths from the patient is compared with known 
emission wavelengths for normal tissue. And fourth, a diagnostic determination on the 
cervical tissue is made. 

H. Multivariate Statistical Method Development 

The five primary steps involved in multivariate statistical methods of the present 
invention are (i) preprocessing of spectral data from each patient to account for inter- 
patient variation, (ii) partitioning of the preprocessed spectral data from all patients into 
calibration and prediction sets, (iii) dimension reduction of the preprocessed spectra in 
the calibration set using principal component analysis, (iv) selection of the diagnostically 
most useful principal components using a two-sided unpaired t-test and (v) development 
of an optimal classification scheme based on logistic discrimination using the 
diagnostically useful principal component scores of the calibration set as inputs. These 
five individual steps of the multivariate statistical method are discussed below in more 
detail. 

(i) Preprocessing: The objective of preprocessing is to calibrate tissue spectra for 
inter-patient variation which might obscure differences in the spectra of different 
tissue types. Four methods of preprocessing were invoked on the spectral data: 
(a) normalization, (b) mean scaling, (c) a combination of normalization and mean 
scaling and (d) median scaling. 
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Spectra were normalized by dividing the fluorescence intensity at each emission 
wavelength by the maximum fluorescence intensity of that sample. Normalizing 
a fluorescence spectrum removes absolute intensity information; methods 
developed from normalized fluorescence spectra rely on differences in spectral 
S line shape information for diagnosis. If the contribution of the absolute intensity 

information is not significant, two advantages are realized by utilizing normalized 
spectra. First, it is no longer necessary to calibrate for inter-patient variation of 
normal tissue fluorescence intensity as in the two-stage method. And second, 
identification of a colposcopically normal reference site in each patient prior to 
1 0 spectroscopic analysis is no longer needed. 

Mean scaling was performed by calculating the mean spectrum for a patient 
(using all spectra obtained from cervical sites in that patient) and subtracting it 
from each spectrum in that patient. Mean-scaling can be performed on both 
unnormalized (original) and normalized spectra. Mean-scaling does not require 
colposcopy to identify a reference normal site in each patient prior to 
spectroscopic analysis. However, unlike normalization, mean-scaling displays 
the differences in the fluorescence spectrum from a particular site with respect to 
the average spectrum from that patient Therefore this method can enhance 
differences in fluorescence spectra between tissue categories most effectively 
when spectra are acquired from approximately equal numbers of non diseased 
and diseased sites from each patient. 

Median scaling is performed by calculating the median spectrum for a patient 
(using all spectra obtained from cervical sites in that patient) and subtracting it 
from each spectrum in that patient. Like mean scaling, median scaling can be 
performed on both unnormalized (original) and normalized spectra, and median 
scaling does not require colposcopy to identify a reference normal site in each 
patient prior to spectroscopic analysis. However, unlike mean scaling, median 
scaling does not require the acquisition of spectra from equal numbers of non 
diseased and diseased sites from each patient. 
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(ii) Calibration and Prediction Data Sets: The preprocessed spectral data were 
randomly assigned into either a calibration or prediction set. The multivariate 
statistical method was developed and optimized using the calibration set. It was 

5 then tested prospectively on the prediction data set 

(iii) Principal Component Analysis: Principal component analysis (PCA) is a 
linear model which transforms the original variables of a fluorescence emission 
spectrum into a smaller set of linear combinations of the original variables called 

10 principal components that account for most of the variance of the original data 

set. Principal component analysis is described in Dillon W.R., Goldstein M, 
Multivariate Analysis: Methods and Applications, John Wiley and Sons, 1984, 
pp. 23-52, the disclosure of which is expressly incorporated herein by reference. 
While PCA may not provide direct insight to the morphologic and biochemical 

1 ? ba sis of tissue spectra, it provides a novel approach of condensing all the spectral 

information into a few manageable components, with minimal information loss. 
Furthermore, each principal component can be easily related to the original 
emission spectrum, thus providing insight into diagnostically useful emission 
variables. 
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Prior to PCA, a data matrix is created where each row of the matrix contains the 
preprocessed fluorescence spectrum of a sample and each column contains the 
pre-processed fluorescence intensity at each emission wavelength. The data 
matrix D (RC), consisting of R rows (corresponding to r total samples from all 
patients in the training set) and C columns (corresponding to intensity at c 
emission wavelengths) can be written as: 



D = 



(D,A,..D lc ^ 
D„D a ...D ae 

DnD, 2 ...Dj 



(1) 



WO 97/48331 



PCT/US97/10729 



16 



The first step in PC A is to calculate the co variance matrix, Z. First, each column 
of the preprocessed data matrix D is mean-seated. The mean-scaled preprocessed 
data matrix, D m is then multiplied by its transpose and each element of the 
resulting square matrix is divided by (r-1), where r is the total number of samples. 
The equation for calculating Z is defined as: 



Z = ~(Dn« 'Dm) 

r- 1 



(2) 



The square covariance matrix, Z (c x c) is decomposed into its respective 
eigenvalues and eigenvectors. Because of experimental error, the total number of 
eigenvalues will always equal the total number of columns (c) in the data matrix 
D assuming that c < r. The goal is to select n < c eigenvalues that can describe 
most of the variance of the original data matrix to within experimental error. The 
variance, V accounted for by the first n eigenvalues can be calculated as follows: 



V = 100 



( % \ 



\ H J 



(3) 



The criterion used in this analysis was to retain the first n eigenvalues and 
corresponding eigenvectors that account for 99 % of the variance in the original 
data set. 



Next, the principal component score matrix can be calculated according to the 
following equation: 



R = DC (4) 
where, D (RC) is the preprocessed data matrix and C (c x n) is a matrix whose 
columns contain the n eigenvectors which correspond to the first n eigenvalues. 
Each row of the score matrix R (r x c) corresponds to the principal component 
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scores of a sample and each column corresponds to a principal component. The 
principal components are mutually orthogonal to each other. 

Finally, the component loading is calculated for each principal component. The 
component loading represents the correlation between the principal component 
and the variables of the original fluorescence emission spectrum. The component 
loading can be calculated as shown below: 

_ Cy 

(5) 



where, CLy represents the correlation between the ith variable (preprocessed 
intensity at ith emission wavelength) and the jth principal component. Cg is the 
ith component of the jth eigenvector, X } is the jth eigenvalue and S H is the 
variance of the ith variable. 

Principal component analysis was performed on each type of preprocessed data 
matrix, described above. Eigenvalues accounting for 99% of the variance in the 
original preprocessed data set were retained The corresponding eigenvectors were 
then multiplied by the original data matrix to obtain the principal component 
score matrix R. 

(iv) Student's T-Test: Average values of principal component scores were 
calculated for each histopathologic tissue category for each principal component 
obtained from the preprocessed data matrix. A two-sided unpaired students t-test 
was employed to determine the diagnostic contribution of each principal 
component. Such a test is disclosed in Devore J.L., Probability and Statistics for 
Engineering and the Sciences, Brooks/Cole, 1992, and in Walpole R.E., Myers 
R.H., Probability and Statistics for Engineers and Scientists, Macmillan 
Publishing Co., 1978, Chapter 7, the disclosures of which are expressly 
incorporated herein by reference. The hypothesis that the means of the principal 
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component scores of two tissue categories are different were tested for 1) normal 
squamous epithelia and SILs, 2) columnar normal epithelia and SILs and 3) 
inflammation and SILs. The t-test was extended a step further to determine if 
there are any statistically significant differences between the means of the 
principal component scores of high grade SILs and low grade SILs. Principal 
components for which the hypothesis stated above were true below the 0.05 level 
of significance were retained for further analysis. 

(v) Logistic Discrimination: Logistic discriminant analysis is a statistical 
technique that can be used to develop diagnostic methods based on posterior 
probabilities, overcoming the drawback of the binary decision scheme employed 
in the two-stage method. This statistical classification method is based on Bayes 
theorem and can be used to calculate the posterior probability that an unknown 
sample belongs to each of the possible tissue categories identified. Logistic 
1 5 discrimination is discussed in Albert A.. Harris E.K., Multivariate Interpretation 

of Clinical Laboratory Data, Marcel Dekker, 1987, the disclosure of which is 
expressly incorporated herein by reference. Classifying the unknown sample into 
the tissue category for which its posterior probability is highest results in a 
classification scheme that minimizes the rate of misclassification. 
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For two diagnostic categories, G, and G 2 , the posterior probability of being a 
member of G u given measurement x. according to Bayes theorem is: 



P(Gl |X) = P(x|G,)P(G,)C(2|l) 

P(x|G,)P(G,)C(2|l) + P(x|G2)P(G J )C(l|2) 



(6) 



where P(x«G,) is the conditional joint probability that a tissue sample of type i 
will have principal component score x, and P(Gj) is the prior probability of 
finding tissue type i in the sample population. CO Hi) is the cost of 
misclassifying a sample into group j when the actual membership is group i. 
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The prior probability P(Gi) is an estimate of the likelihood that a sample of type i 
belongs to a particular group when no information about it is available. If the 
sample is considered representative of the population, the observed proportions of 
cases in each group can serve as estimates of the prior probabilities. In a clinical 
setting, either historical incidence figures appropriate for the patient population 
can be used to generate prior probabilities, or the practitioner's colposcopic 
assessment of the likelihood of precancer can be used to estimate prior 
probabilities. 

The conditional probabilities can be developed from the probability distributions 
of the n principal component scores for each tissue type, i. The probability 
distributions can be modeled using the gamma function, which is characterized 
by two parameters, alpha and beta, which are related to the mean and standard 
deviation of the data set. The Gamma function is typically used to model skewed 
distributions and is defined below: . 

/(x;a,p)=~— i-V'" (7) 

P r\a) 

The gamma function can be used to calculate the conditional probability that a 
sample from tissue type i, will exhibit the principal component score, x. If more 
than one principal component is needed to describe a sample population, then the 
conditional joint probability is simply the product of the conditional probabilities 
of each principal component (assuming that each principal component is an 
independent variable) for that sample population. 

III. Instrumentation 

Fluorescence occurs when a fraction of the light absorbed by the tissue is 
reradiated at emission wavelengths that are longer than the excitation light Thus, the 
apparatus of the present invention includes a controllable illumination device for emitting 
a plurality of electromagnetic radiation wavelengths selected to cause a tissue to produce 
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a fluorescence intensity spectrum. Typically, the light source is a laser. Also included 
are an optical system for applying the plurality of radiation wavelengths to a tissue 
sample, a fluorescence intensity spectrum detecting device for detecting an intensity of 
fluorescence spectra emitted by the sample as a result of illumination by the plurality of 
electromagnetic radiation wavelengths. Optionally, the system may include a data 
processor, connected to the detecting device, for analyzing detected fluorescence spectra 
to calculate a probability that the sample is abnormal. 

A schematic of the portable fluorimeter which was used to acquire cervical tissue 
fluorescence spectra at three excitation wavelengths is shown in FIG. 1(1). The fiber- 
optic probe includes a central fiber surrounded by a circular array of six fibers; all seven 
fibers have the same characteristics (0.22 NA, 200 jim core diameter). Three fibers 
along the diameter of the distal end of the probe (FIG. 1 (3)) are used for excitation light 
delivery (overlap of the illumination area viewed by the three excitation fibers is greater 
than 85%). The purpose of the remaining four fibers is to collect the emitted 
fluorescence from the area (1 mm diameter) directly illuminated by the probe. A quartz 
shield at the tip of the distal end of the probe which is in direct tissue contact (FIG. 1 (5)) 
provides a fixed distance between the optical fibers and the tissue surface so fluorescence 
intensity can be measured in calibrated units. 

Two nitrogen pumped-dye lasers are used tb provide illumination at three 
different excitation wavelengths: one laser serves to deliver excitation light at 337 nm 
(fundamental) and has a dye module which is used to generate light at 380 nm using the 
fluorescent dye, BBQ (1E-03 M in 7 parts toluene and 3 parts ethanol). The dye module 
of the second laser is used to provide illumination at 460 nm, using the fluorescent dye, 
Coumarin 460 (1E-02 M in ethanol). Laser illumination at each excitation wavelength, 
337, 380 and 460 nm is coupled into each of the excitation fibers. In this study, the 
average transmitted pulse energies at 337, 380 and 460 nm excitation were 12, 9 and 14 
MJ» respectively. The laser characteristics were a 5 ns pulse duration and a repetition rate 
of 30 Hz. 
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The proximal ends of the four emission collection fibers are arranged in a circular 
array and imaged at the entrance slit of a polychromator coupled to a 1,024 intensified 
diode array controlled by a multi-channel analyzer. 360, 400 and 470 nm long pass 
filters are used to block scattered excitation light at 337, 380 and 460 nm excitation, 
respectively from the detector. A 205 ns collection gate, synchronized to the leading edge 
of the laser pulse using a pulser (Princeton Instruments, PG200), eliminates the effects of 
the colposcope's white light illumination during fluorescence measurements. Data 
acquisition is computer controlled. 

IV- Comb i ll i nP Fluorescence and Raman S p ectmspnpi^ 

The present invention also contemplates a method that combines fluorescence 
spectroscopy, as described above, with NIR Raman spectra, in vitro or in vivo. This is 
accomplished through a light source, sequentially or simultaneously, in conjunction with 
an optical coupling system for the application and analysis of both kinds of data. For 
example, a probe is selectively coupled , to ultraviolet or visible sources of 
electromagnetic radiation to excite fluorescence, and then selectively coupled to NIR 
sources to excite fluorescence-free Raman spectra. The fluorescence spectra may be 
used to improve the analytical rejection of fluorescence from the Raman spectrum. 

In one embodiment, the apparatus comprises a diachronic mirror or swing-way 
mirror so that each electromagnetic radiation: source is selectively coupled to the optical 
excitation fiber(s). Similarly, light collected by the collection fiber may be selectively 
coupled to the appropriate detectors. Alternatively, a probe may house discrete sets of 
fluorescence and Raman excitation and detection fibers, thereby obviating the need for 
mirrors. 

In analyzing the spectra, fluorescence may be used advantageously to identify 
normal tissues and low and high grade lesions. NIR Raman spectra can be used 
advantageously to identify inflammation and metaplasia. Alternatively, information 
gathered about the tissue type, in accordance with the above-described fluorescence 
methods, can be used to improve the Raman diagnostic capability. This is accomplished 
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by using fluorescence spectra to calculate the posterior probability that tissue is normal, 
low grade or high grade SIL. Then, this classification is used as the prior probability in 
logistic discrimination, based on the detected Raman spectra. In yet another 
embodiment, information gathered with NIR Raman spectroscopy is used to calculate the 
5 posterior probability that the tissue is inflamed or metaplastic. Then, this information is 
used as the prior probability in logistic discrimination, based on the detected fluorescence 
spectrum. By the same token, Raman can improve the diagnostic performance of 
fluorescence by reducing misclassiflcation of inflammation and metaplasia as precancer. 

10 In one embodiment, the Raman method comprises illuminating a sample with an 

electromagnetic radiation wavelength in the near infrared to produce a Raman spectrum 
shifted from the illumination wavelength; detecting a plurality of emission frequencies of 
the spectrum; and establishing from the emission frequencies a probability that the 
sample is abnormal. The excitation wavelength typically is between 700 and 850 am, 

1 5 with a wavelength of about 789 nm being a specific embodiment. Emission frequencies 
are shifted about 626. 818, 978, 1070, 1 175, 1246, 1330, 1454 and 1656 cm" 1 from 789 
nm in a more specific embodiment. 

Another approach that may be taken in examining Raman spectra is to look at the 
20 Raman signatures of specific compounds. In this embodiment, compounds of interest 
include collagen, phopholipids and glucose- 1 -phosphate. The Raman spectrum 
measured for collagen contains peaks at 770, 826, 1066, 1181, 1248, 1452 and 1660 cm" 
\ Glucose 1 -phosphate is an intermediate product in the glycogen-glucose conversion 
cycle and found in abundance in cervical epithelial cells. The spectrum of glucose 1- 
25 phosphate includes peaks at 850, 970, 1053, 1091, 1 146 and 1350 cm" 1 . The peaks for 
one major phospholipid, phosphatidylcholine, are at 740, 980, 1091, 1320 and 1454 cm" 1 . 

V. Examples 

The following examples are included to demonstrate specific embodiments of the 
30 present invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventors 
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to function well in the practice of the invention, and thus can be considered to constitute 
specific modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

EXAMPLE It MFTHOnS 

Clinical measurements. A randomly selected group of non-pregnant patients 
referred to the colposcopy clinic of the University of Texas MD Anderson Cancer Center 
on the basis of abnormal cervical cytology was asked to participate in the in vivo 
fluorescence spectroscopy study. Informed consent was obtained from each patient who 
participated and the study was reviewed and approved by the Institutional Review 
Boards of the University of Texas, Austin and. the University of Texas, MD Anderson 
Cancer Center. Each patient underwent a complete history and a physical examination 
including a pelvic exam, a Pap smear and colposcopy of the cervix, vagina and vulva. 

After colposcop'ic examination of the cervix, but before tissue biopsy, 
fluorescence spectra were acquired on average from two colposcopically abnormal sites, 
two colposcopically normal squamous sites and 1 normal columnar site (if 
colposcopically visible) from each patient. Tissue biopsies were obtained only from 
abnormal sites identified by colposcopy and subsequently analyzed by the probe to 
comply with routine patient care procedure. All tissue biopsies were fixed in formalin 
and submitted for histologic examination. Hemotoxylin and eosin stained sections of 
each biopsy specimen were evaluated by a panel of four board certified pathologists and 
a consensus diagnosis was established using the Bethesda classification system [1]. This 
classification system which has previously been used to grade cytologic specimens has 
now been extended to classification of histology samples. Samples were classified as 
normal squamous, normal columnar, inflammation, low grade SIL or high grade SIL. 
Samples with multiple diagnoses were classified into the most severe histo-pathologic 
category. 
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Prior to each patient study, the probe was disinfected and a background spectrum 
was acquired at all three excitation wavelengths consecutively with the probe dipped in a 
non-fluorescent bottle containing distilled water. The background spectrum was 
subtracted from all subsequently acquired spectra at corresponding excitation 
wavelengths for that patient. Next, with the probe placed on the face of a quartz cuvette 
containing a solution of Rhodamine 610 dissolved in ethylene glycol (2 mg/L), 50 
fluorescence spectra were measured at each excitation wavelength. After calibration, 
fluorescence spectra were acquired from the cervix: 10 spectra for 10 consecutive pulses 
were acquired at 337 nm excitation; next, 50 spectra for 50 consecutive laser pulses were 
measured at 380 nm excitation and then at 460 nm excitation. The data acquisition time 
was 0.33 s at 337 nm excitation and 1.67 s at each 380 and 460 nm excitation per 
cervical site. Spectra were collected in the visible region of the electromagnetic 
spectrum with a resolution of 10 nm (full width at half maximum) and a signal to noise 
ratio of 30:1 at the fluorescence maximum at each excitation wavelength. 

All spectra were corrected for the non-uniform spectral response of the detection 
system using correction factors obtained by recording the spectrum of an N.I.S.T 
traceable calibrated tungsten ribbon filament lamp. Spectra from each cervical site at 
each excitation wavelength were averaged and normalized to the peak fluorescence 
intensity of the Rhodamine 610 calibration standard at the corresponding excitation 
wavelength for that patient; absolute fluorescence intensities are reported in these 
calibrated units. In this clinical study, fluorescence spectra were acquired at all three 
excitation wavelengths from each cervical site from a total of 381 sites in 95 patients 
during colposcopy. 

EXAMPLE 2; STATISTICAL MF.THOns 

Development of screening and diagnostic algorithms: FIG. 2 illustrates a 
schematic of the formal analytical process used to develop screening and diagnostic 
algorithms for the differential detection of SILs, in vivo. In FIG. 2, the text in the 
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dashed-line boxes represents the mathematical steps implemented on the spectral data, 
and the text in the solid-line boxes represent the output after each mathematical process. 
There are four primary steps involved in the multivariate statistical analysis of tissue 
spectral data (FIG. 2). The first step is to pre-process spectral data to reduce inter-patient 
5 and intra-patient variation within a tissue type; the pre-processed spectra arc then 
dimensionally reduced into an informative set of principal components which describe 
most of the variance of the original spectral data set using Principal Component Analysis 
(PC A). Next, the principal components which contain diagnostically relevant 
information are selected using an unpaired, one-sided student's t-test, and finally a 
10 classification algorithm based on logistic discrimination is developed using these 
diagnostically relevant principal components. 

In summary, three constituent algorithms were developed using multivariate 
statistical analysis (FIG. 2): a constituent algorithm (1) discriminates between SILs and 

15 normal squamous tissues, a constituent algorithm (2) discriminates between SILs and 
normal columnar tissues and finally, a constituent algorithm (3) differentiates high grade 
SILs from low grade SILs. The three constituent algorithms were then combined to 
develop two composite algorithms (FIG. 2): constituent algorithms (!) and (2) were 
combined to develop a composite screening algorithm which discriminates between SILs 

20 and non SILs. All three constituent algorithms were then combined to develop a 
composite diagnostic algorithm which differentiates high grade SILs from non-high 
grade SILs. 

Multivariate statistical analysis of cervical tissue spectra. As a first step, three 
25 methods of pre-processing were applied to the spectral data at each excitation 
wavelength: 1) normalization 2) mean-scaling and 3) a combination of normalization and 
mean-scaling. Similarly pre-processed spectra at each excitation wavelength were 
combined to create spectral inputs at the following combinations of excitation 
wavelengths: (337, 460) nm, (337, 380) nm, (380, 460) nm and (337, 380, 460) nm. Pre- 
30 processing of spectral data resulted in four types of spectral inputs (original and three 
types of pre-processed spectral inputs) at four single excitation wavelengths and at four 
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possiblc combinations of multiple excitation wavelengths. Hence, there were a total of 
12 spectral inputs at single excitation wavelengths and 16 spectral inputs at multiple 
excitation wavelengths which were evaluated using the multivariate statistical algorithm. 

Prior to PCA, the input data matrix, D (RC) was created so each row of the 
matrix corresponded to the pre-processed fluorescence spectrum of a sample and each 
column corresponded to the pre-processed fluorescence intensity at each emission 
wavelength. Spectral inputs at multiple excitation wavelengths were created by 
arranging spectra at each excitation wavelength in series in the original spectral data 
matrix. PCA [28] was used to dimensionally reduce the pre-processed spectral data 
matrix into a smaller orthogonal set of linear combinations of the emission variables that 
account for most of the variance of the spectral data set. 

Average values of principal component scores were calculated for each principal 
component of each tissue type. An unpaired, one-sided student's t-test [29] was 
employed to determine the diagnostic content of each principal component. The 
hypothesis that the means of the principal component scores of two tissue types are 
different was tested for (1) normal squamous epithelia and SILs, (2) normal columnar 
epithelia and SILs and (3) inflammation and SILs. The t-test was extended a step further 
to determine if there were any statistically significant differences between the means of 
the principal component scores of high grade SILs and low grade SILs. Principal 
components for which the hypothesis stated above was statistically significant (/> < 0.05) 
were retained for further analysis. 

Next, a statistical classification algorithm was developed using the diagnostically 
useful principal components to calculate the posterior probability that an unknown 
sample belongs to each tissue type under consideration. The posterior probability of an 
unknown sample belonging to each tissue type was calculated using logistic 
discrimination [30]. The posterior probability is related to the prior and conditional joint 
probabilities and to the costs of misclassification of the tissue types under consideration. 
The prior probability of each tissue type was determined by calculating the observed 
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proportion of cases in each group. The cost of misclassification of a particular tissue 
type was varied from 0 to 1 in 0.1 increments, and the optimal cost was identified when 
the total number of misclassified samples based on the classification algorithm was a 
minimum. If there was more than one cost at which the total number of misclassified 
samples was a minimum, the cost that maximized sensitivity was selected. The 
conditional joint probabilities were developed by modeling the probability distribution of 
each principal component of each tissue type using the normal probability density 
function [31], which is characterized by p (mean) and a (standard deviation). The best 
fit of the normal probability density function to the probability distribution of each 
principal component (score) of each tissue type was obtained in the least squares sense, 
using \x and o as free parameters of the fit. The normal probability density function was 
then used to calculate the conditional joint probability that an unknown sample, given 
that it is from tissue type i, will exhibit a set of principal component scores, X. 

The multivariate statistical algorithm was developed and optimized using a 
calibration set and then tested in an unbiased manner on a prediction set of 
approximately equal prior probability (Table 1). Data in the prediction set is pre- 
processed and organized into two prediction datasets in the following way: 

Spectra obtained from each patient at each excitation wavelength are separately 
(1) normalized and (2) normalized, followed by mean-scaling. Spectra at each excitation 
wavelength, processed in a similar manner arc; concatenated into a vector. Two 
prediction data matrices are developed. In each matrix, each row is a vector containing 
similarly pre-processed fluorescence emission spectra at 337, 380 and 460 nm excitation 
concatenated and each column corresponds to pre-processed fluorescence intensity at a 
particular excitation emission wavelength pair. 

These processed data matrices are then used to test the composite screening 
algorithm performance. The steps of this test are: 



-28- 



The normalized prediction data matrix (Dn*) is multiplied by the reduced 
eigenvector matrix from normalized spectral data of the calibration set 
(Cn'). Cn* contains only those eigenvectors which displayed statistically 
significant differences for samples to be classified by constituent 
algorithm 1. 

The posterior probabilities that a sample is SIL or normal squamous 
epithelium are calculated using Bayes theorem. In this calculation, the 
mean values and standard deviations of the PC scores for normal 
squamous epithelium and SILs and prior probabilities and optimal costs of 
misclassification of the calibration set are used. 

The normalized, mean-scaled prediction data matrix (Dnm*) is multiplied 
by the reduced eigenvector matrix from normalized, mean-scaled spectral 
data of the calibration set (Cnm*). CnnT contains only those eigenvectors 
which displayed statistically significant differences for samples to be 
classified by constituent algorithm 2. 

The posterior probabilities that a sample is SIL or normal columnar 
epithelium are calculated using Bayes theorem.. In this calculation, the 
mean values and standard deviations of the PC scores for normal 
columnar epithelium and SILs and prior probabilities and optimal costs of 
misclassification of the calibration set are used. 

Using constituent algorithm 1, samples with a posterior probability of 
being normal squamous epithelium greater than a threshold value are 
classified as non-SIL. Remaining samples are classified based on the 
output of constituent algorithm 2. Using constituent algorithm 2, sample 
with a posterior probability of being normal columnar epithelium greater 
than a threshold are classified as non-SIL. The remaining samples are 
classified as SIL. 
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The processed data matrices are then used to test the composite diagnostic 
algorithm performance. The steps of this test are: 

The normalized prediction data matrix (Dn') is multiplied by the reduced 
eigenvector matrix from normalized spectral data of the calibration set 
(Cn'). Cn' contains only those eigenvectors which displayed statistically 
significant differences for samples to be classified by constituent 
algorithm 1. 

The posterior probabilities that a sample is SIL or normal squamous 
epithelium are calculated using Bayes theorem. In this calculation, the 
mean values and standard deviations of the PC scores for normal 
squamous epithelium and SILsand prior probabilities and optimal costs of 
misclassification of the calibration set are used. 

The normalized, mean-scaled prediction data matrix (Dnm') is multiplied 
by the reduced eigenvector matrix from normalized, mean-scaled spectral 
data of the calibration set (Cnm*). Cnm' contains only those eigenvectors 
which displayed statistically significant differences for samples to be 
classified by constituent algorithm 2. 

The posterior probabilities that a sample is SIL or normal columnar 
epithelium are calculated using Bayes theorem. In this calculation, the 
mean values and standard deviations of the PC scores for normal 
columnar epithelium and SILs and prior probabilities and optimal costs of 
misclassification of the calibration set are used. 

The normalized prediction data matrix (Dn') is multiplied by the reduced 
eigenvector matrix from normalized spectral data of the calibration set 
(Cn'). Cn' contains only those eigenvectors which displayed statistically 
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significant differences for samples to be classified by constituent 
algorithm 3. 

The posterior probabilities that a sample is HGSIL or LGSIL are 
S calculated using Bayes theorem. In this calculation, the mean values and 

standard deviations of the PC scores for HGSILs and LGSILs and prior 
probabilities and optimal costs of misclassification of the calibration set 
are used. 

10 Using constituent algorithm 1, samples with a posterior probability of 

being normal squamous epithelium greater than a threshold are classified 
as non-SIL. Remaining samples are classified based on the output of 
constituent algorithm 2. Using constituent algorithm 2, sample with a 
posterior probability of being normal columnar epithelium greater than a 

IS threshold are classified as non-SIL. Remaining samples are classified 

based on the output of constituent algorithm 3. Using constituent 
algorithm 3, samples with a posterior probability of being LGSIL greater 
than a threshold are classified as LGSIL. The remaining samples are 
classified as HGSIL. 

20 

The calibration and prediction sets were developed by randomly assigning the 
spectral data into the two sets with the condition that both contain roughly equal number 
of samples from each histopathologic category. The random assignment ensured that 
not all spectra from a single patient were contained in the same data set. 



Histo-pathology 


Calibration Set 


Prediction Set 


Normal Squamous 


94 


94 


Normal Columnar 


13 


14 


Inflammation 


IS 


14 


Low Grade SIL 


23 


24 


High Grade SIL 


35 


35 



25 

Table 1. Histopathologic classification of samples from the calibration and prediction 
sets. Note, biopsies for histological evaluation were hot obtained from colposcopically 



WO 97/48331 



PCT/US97/10729 



-31 - 

normal squamous and columnar tissue sites to comply with routine patient care 
procedure. 

Development of constituent algorithms: The multivariate statistical algorithm 
5 was developed and optimized using all 28 types of pre-processed spectral inputs from the 

calibration set. The algorithm was used to identify spectral inputs which provide the 
greatest discrimination between the following pairs of tissue types: (1) SILs and normal 
squamous epithelia, (2) SILs and normal columnar epithelia, (3) SILs and inflammation, 
and (4) high grade SILs and low grade SILs. The optimal spectral input for 
10 differentiating between two particular tissue types was identified when the total number 

of samples misclassified from the calibration set using the multivariate statistical 
algorithm was a minimum. The algorithm based on the spectral input that minimized 
misclassification between the two tissue types under consideration was implemented on 
the prediction data set. 

15 

Three multivariate statistical constituent algorithms were developed using tissue 
spectra at three excitation wavelengths. Constituent algorithm (1) was developed to 
differentiate between SILs and normal squamous epithelia; constituent algorithm (2) was 
developed to differentiate between SILs and normal columnar epithelia and constituent 
20 algorithm (3) could be used to discriminate between low grade SILs and high grade SILs. 

Development of composite algorithms: Each of the independently developed 
constituent algorithms was intended to discriminate only between pairs of tissue types. 
A combination of these constituent algorithms was required to provide discrimination 

25 between several of the clinically relevant tissue types. Therefore, two composite 
algorithms were developed: a composite screening algorithm was developed to 
differentiate between SILs and non SILs (normal squamous and columnar epithelia and 
inflammation) using constituent algorithms (1) and (2) and a composite diagnostic 
algorithm was developed to differentiate high grade SILs from non-high grade SILs (low 

30 grade SILs, normal epithelia and inflammation) using all three constituent algorithms. 
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The composite screening algorithm was developed in the following manner. 
First, constituent algorithms (1) and (2) were developed independently using the 
calibration data set. The classification outputs from both constituent algorithms were 
used to determine if a sample being evaluated is S1L or non-SIL: first, using constituent 
algorithm (1), samples were classified as non SIL if they had a probability that is less 
than 0.5; otherwise, they were classified as SIL. Next, only samples that were classified 
as SIL based on the algorithm (1) were tested using algorithm (2). Again, samples were 
classified as non SIL if their posterior probability was less than 0.5; otherwise they were 
classified as SIL. The spectral data from the prediction set was evaluated using the 
composite screening algorithm in an identical manner. 

The composite diagnostic algorithm was implemented in the following manner. 
The three constituent algorithms were developed independently using the" calibration set. 
Algorithms (1) and (2) were implemented on each sample from the calibration data set. 
as described previously. Only samples that were classified as SIL based on algorithms 
0) and (2) were tested using algorithm (3). If samples evaluated using algorithm t'3) had 
a posterior probability greater than 0.5, they were classified as high grade SIL; otherwise 
they were classified as non-high grade SIL. The spectral data from the prediction set was 
evaluated using the composite diagnostic algorithm in an identical manner. 

EXAMPLE* RESULTS 

Constituent algorithms (J), (2) and (3): Table 2 summarizes the components of 
the optimal set of three constituent algorithms. Constituent algorithm (1) can be used to 
differentiate between SILs and normal squamous epithelia; algorithm (2) differentiates 
between SILs and normal columnar epithelia; and algorithm (3) discriminates between 
low grade SILs and high grade SILs. 
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Pre-processing: FIG. 3A illustrates average fluorescence spectra per site 
, acquired from cervical sites at 337 nm excitation from a typical patient. All fluorescence 
intensities are reported in the same set of calibrated units; Corresponding normalized 
and normalized, mean-scaled spectra are illustrated in FIGS. 3B and 3C, respectively. 
Evaluation of the original spectra at 337 nm excitation (FIG. 3A) indicates that the 
fluorescence intensity of SILs is less than that of the corresponding normal squamous 
tissue and greater than that of the corresponding normal columnar tissue over the entire 
emission spectrum. Examination of normalized spectra from this patient (FIG. 3B) 
indicates that following normalization, the fluorescence intensity of the normal squamous 
tissue is greater than that of corresponding SILs over the wavelength range 360 to 450 
nm only; between 460 and 600 nm, the fluorescence intensity of SILs is greater than that 
of the corresponding normal squamous tissue which in part reflects the longer peak 
emission wavelength of SILs. A comparison of the spectral line shape of SILs to that of 
the normal columnar tissue illustrates the opposite phenomenon. The norma!i7ed 
fluorescence intensity of SILs is greater than that of the a>rresponding normal columnar 
tissue over the wavelength range 360 to 450 nm; however, between 460 and 600 nm, the 
fluorescence intensity of the normal columnar tissue is greater than mat of the SILs. This 
spectral difference reflects the longer peak emission wavelength of the normal columnar 
tissue relative to that of SILs. Further evaluation of normalized spectra in FIG. 3B 
indicates that there are spectral line shape differences between low grade SILs and high 
grade SILs over the wavelength range 360 to 420 nm. 

The corresponding normalized, mean-scaled spectra of this patient, shown in FIG. 
3C displays differences in the normalized fluorescence spectrum (FIG. 3B) from a 
particular site with respect to the average normalized spectrum from this patient. 
Evaluation of FIG. 3C indicates that between 360 and 450 nm, the normalized, mean- 
scaled fluorescence intensity of the normal squamous tissue is greater than the mean 
(Y=0), and that of the normal columnar tissue is less than the mean. Above 460 nm, the 
opposite phenomenon is observed; the fluorescence intensity of the normal squamous 
tissue is less than the mean, while that of the normal columnar tissue is greater than the 
mean. The fluorescence intensity of SILs lies close to the mean and is bounded by the 
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intensities of the two normal tissue types. In addition, between 360 and 420 nm, the 
normalized, mean-scaled fluorescence intensity of the low grade SIL is slightly greater 
than the mean, while that of the high grade SIL is less than the mean. 

FIG. 4A illustrates average fluorescence spectra per site acquired from cervical 
sites at 380 nm excitation, from the same patient. FIGS. 4B and 4C show the 
corresponding normalized, and normalized, mean-scaled spectra, respectively. In FIG. 
4A, the fluorescence intensity of SILs is less than that of the corresponding normal 
squamous tissue, with the low grade SIL exhibiting the weakest fluorescence intensity 
over the entire emission spectrum. Note that the fluorescence intensity of the normal 
columnar sample is indistinguishable from that of the low grade SIL. Normalized spectra 
at 380 nm excitation, (FIG. 4B), indicate that over the wavelength range 400 to 450 nm, 
the. fluorescence intensity of the normal squamous tissue is slightly greater than that of 
SILs and that of the normal columnar tissue is less than that of SILs. The opposite 
phenomenon is observed above 580 nm. A careful examination of the spectra of the low 
grade SIL and high grade SIL indicates that between 460 and 580 nm, the normalized 
fluorescence intensity of the low grade SIL is higher than that of the high grade SIL. The 
normalized, mean-scaled spectra (FIG. 4C) enhances the previously observed normalized 
spectral line shape differences by displaying theni relative to the average normalized 
spectrum of this patient. FIG. 4C indicates that between 400 to 450 nm, the fluorescence 
intensity of the normal squamous tissue is greater than the mean and that of the normal 
columnar tissue is less than the mean. The opposite phenomenon is observed above 460 
nm. The fluorescence intensity of the SILs is bounded by the intensities of the two 
normal tissue types over the entire emission spectrum. The low grade SIL and high grade 
SIL also show spectral line shape differences; above 460 nm, the normalized, mean- 
scaled fluorescence intensity of the low grade SIL lies above the mean and that of the 
high grade SIL lies below the mean. 

FIGS. 5A, 5B and 5C illustrate original, normalized and normalized, mean-scaled 
spectra, respectively at 460 nm excitation from the same patient. Evaluation of FIG. 5A 
indicates that the fluorescence intensity of SILs is less than that of the corresponding 
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normal squamous tissue and greater than that of the corresponding normal columnar 
sample over the entire emission spectrum. Evaluation of normalized spectra at this 
excitation wavelength (FIG. 5B) demonstrates that below 510 nm, the fluorescence 
intensity of SILs is less than that of the normal squamous tissue and greater than that of 
5 the corresponding normal columnar tissue. Above, 580 nm, the normalized fluorescence 
intensity of SILs is less than that of the normal columnar tissue and greater then that of 
normal squamous tissue. Note that there arc spectral line shape differences between the 
low grade SIL and high grade SIL between 580 and 660 nm; the normalized fluorescence 
intensity of the low grade SIL is greater than that of the high grade SIL. The normalized, 
10 mean-scaled spectra shown in FIG. 5C reflects the differences observed in the 
normalized spectra relative to the average normalized spectrum of this patient. Below 
510 nm, the fluorescence intensity of the normal squamous tissue is greater than the 
mean, while that of the normal columnar tissue is less than the mean. Above 580 nnu the 
opposite phenomenon is observed. The fluorescence intensity of the SILs lies between 
15 those of the two normal tissue types. Above 580 nm, the fluorescence intensity of the 
low grade SIL is greater than the mean and that of the high grade SIL is less than the 
mean. 

Principal Component Analysis and Logistic Discrimination: Constituent 
algorithm (1) which differentiates SILs from normal squamous tissues. A constituent 
algorithm based on normalized spectra arranged in series at all three excitation 
wavelengths provided the greatest discrimination between SILs and normal squamous 
tissues. The algorithm demonstrated an incremental improvement in sensitivity without 
sacrificing specificity relative to the previously developed constituent algorithm (1) that 
employed normalized, mean-scaled spectra at 337 nm excitation only. Multivariate 
statistical analysis of normalized tissue spectra at all three excitation wavelengths, 
indicated three principal components show statistically significant differences between 
SILs and normal squamous tissues (Table 2). These three principal components account 
collectively for 65% of the total variance of the spectral data set Logistic discrimination 
was used to develop a classification algorithm to discriminate between SILs and normal 
squamous epithelia based on these three informative principal components. Prior 
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probabilities were determined by calculating the percentage of each tissue type from the 
data set: 62% normal squamous tissues and 38% SILs. The cost of misclassification of 
SIL was optimized at 0.7. Posterior probabilities of belonging to each tissue type were 
calculated for all samples from the data set, using the known prior probabilities, cost of 
misclassification of SILs and the conditional joint probabilities calculated from the 
normal probability density function. FIG. 6 illustrates the retrospective accuracy of the 
algorithm applied to the calibration data set. The posterior probability of being classified 
into the SIL category is plotted for all SILs and normal squamous epithelia. FIG. 6 
indicates that 92% of high grade SILs and 83% of low grade SILs are correctly classified 
with a posterior probability greater than 0.5. Approximately 70% of colposcopically 
normal squamous epithelia are correctly classified with a posterior probability less than 
0.5. 

The confusion matrix in Table 3 compares the retrospective accuracy of the 
algorithm on the calibration data set to its prospective accuracy on the prediction set. In 
the confusion matrix, the first row corresponds to the histo-pathologic classification and 
the first column corresponds to the spectroscopic classification of the samples. A 
prospective evaluation of the algorithm's accuracy indicates thai there is a small increase 
in the proportion of correctly classified low grade SILs and no change in the proportion 
of correctly classified low grade SILs or rornial squamous tissues. Note that the majority 
of normal columnar tissues and samples with inflammation from both calibration and 
prediction sets are misclassified as SIL using this algorithm. Evaluation of the 
misclassified SILs from the calibration set indicates that one sample with CIN III, two 
with CIN II, two with CIN I and two with HPV are incorrectly classified. From the 
prediction set, two samples with CIN III, one with CIN II, two with CIN I and one with 
HPV are incorrectly classified as non-SIL. 
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68% 
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21% 


12% 


9% 


SIL 


32% 


71% 


79% 


88% 


91% 



Table 3. Accuracy of constituent algorithm (1) which differentiates SILs and normal 
squamous tissues from the calibration and prediction sets. Hie first column corresponds 
S to the spectroscopic classification and the first row corresponds to the histopathologic 
classification. 

Constituent algorithm (2) which differentiates SILs from normal columnar 
tissues. The greatest discrimination between SILs and normal columuar epithelia was 

10 achieved using a constituent algorithm based on normalized, mean-scaled spectra at all 
three excitation wavelengths. This algorithm demonstrated a substantially improved 
sensitivity for a similar specificity relative to the previously developed constituent 
algorithm (2) which used normalized, mean-scaled spectra at 380 mn excitation, only. 
Multivariate statistical analysis of a combination of normalized, mean-scaled tissue 

15 spectra at all. three excitation wavelengths resulted in four principal components that 
demonstrate statistically significant differences between SILs and normal columnar 
epithelia (Table 2). These four principal components collectively account for 80% of the 
total variance of the spectral data set Logistic discrimination was employed to develop a 
classification algorithm to discriminate between SILs and normal columnar epithelia. 

20 The prior probabilities were determined to be: 28% normal columnar tissues and 72% 
SILs. The optimized cost of misclassification of SIL was equal to 0.58. Posterior 
probabilities of belonging to each tissue type were calculated for all samples from the 
data set. FIG. 7 illustrates the retrospective accuracy of the algorithm applied to the 
calibration data set. The posterior probability of being classified into the SIL category is 

25 plotted for all SILs and normal columnar samples examined. FIG. 7 graphically 
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indicates that 91% of high grade SILs and 83% of low grade SILs have a posterior 
probability that is greater than 0.5. 76% of colposcopically norma! columnar epithelia are 
correctly classified with a posterior probability less than 0.5. 



The confusion matrix in Table 4 compares the retrospective accuracy of the 
constituent algorithm on the calibration data set to its prospective accuracy on the 
prediction set. The prospective accuracy of the algorithm (Table 4) indicates that there is 
a small increase in the proportion of correctly classified low grade SILs and a small 
decrease in the proportion of correctly classified high grade SILs; there is approximately 
a 10% decrease in the proportion of correctly classified normal columnar tissues. Note 
that the majority of normal squamous tissues and samples with inflammation from both 
the calibration and prediction sets are misclassified as SIL using this algorithm. 
Evaluation of the misclassified SILs from the calibration set indicates that three samples 
with CIN II, three with CIN I and one with HPV are incorrectly classified. From the 
prediction set, two samples with CIN III, three with CIN II, and three with CIN I are 
incorrectly classified. 
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Table 4. Accuracy of constituent algorithm (2) which differentiates SILs and normal 
column* tissues from the calibration and prediction sets. The first column corresponds 
dassS^° SC ° P1C aSSif,Cation 3,1(1 ** flTsi row ooraponds t0 me histo-pathologic 

Constituent algorithm (3) which differentiates High Grade SILs and Low Grade 
SILs. A combination of normalized spectra at all three excitation wavelengths 
significantly enhanced the accuracy of the previously developed constituent algorithm (3) 
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which differentiated high SILs from low grade SILs using normalized spectra at 460 nm 
excitation. Multivariate statistical analysis of normalized spectra at all three excitation 
wavelengths resulted in four statistically significant principal components, that account 
collectively for 67% of the total variance of the spectral data set (Table 2). Again, a 
5 probability based classification algorithm was developed to differentiate high grade SILs 
from low grade SILs. The prior probability was: 40% low grade SILs and 60% high 
grade SILs. The optimal cost of misclassificatibn of high grade SIL was equal to 0.51. 
Posterior probabilities of belonging to each tissue type were calculated. FIG. 8 illustrates 
the retrospective accuracy of the algorithm applied to the calibration data set. The 
10 posterior probability of being classified into the high grade SIL category is plotted for all 
SILs evaluated. FIG. 8 indicates that 83% of high grade SILs have a posterior probability 
greater than 0.5, and 70% of low grade SILs have a posterior probability less than 0.5. 

The confusion matrix in Table 5 compares the retrospective accuracy of the 
15 constituent algorithm on the calibration set to its prospective accuracy on the prediction 
set. Its prospective accuracy indicates that there is a 5% decrease in the proportion of 
correctly classified low grade SILs and no change in the proportion of correctly classified 
high grade SILs. From the calibration set, six high grade SILs are misclassified; three 
samples with CIN III and three with CIN II are misclassified as low grade SIL. The 
20 misclassified low grade SILs comprise of five samples with CIN I and two with HPV. 
From the prediction set, five high grade SILs are misclassified; two have CIN III and 
three have CIN II. Of the ten misclassified low grade SILs from the prediction set, seven 
have CIN I and three have HPV. 
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Table 5. Accuracy of constituent algorithm (3) which differentiates high grade SILs and 
low grade SILs from the calibration and prediction sets. The first column corresponds to 
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the spectroscopic classification and the first row corresponds to the histopathologic 
classification. 

"Full-parameter" composite screening and diagnostic algorithms: A composite 
screening algorithm was developed to differentiate SILs and non-SILs (normal squamous 
and columnar epithelia and inflammation) and a composite diagnostic algorithm was 
developed to differentiate high grade SILs from non-high grade SILs (low grade SILs, 
normal epithelia and inflammation). The effective accuracy of both composite 
algorithms were compared to those of the constituent algorithms from which they were 
developed and to the accuracy of current detection modalities [5,9J. 

A composite screening algorithm which discriminates between SILs and non 
SILs: A composite screening algorithm to differentiate SILs from non-SILs was 
developed using a combination of the two constituent algorithms: algorithm (1) which 
differentiates SILs from normal squamous tissues and algorithm (2) which differentiates 
SILs from normal columnar epithelia. The optimal cost of misclassification of SIL was 
equal to 0.66 for constituent algorithm (1) and 6.64 for constituent algorithm (2). Only 
the costs of misclassification of SIL of the two constituent algorithms was altered for the 
development of the composite screening algorithm. These costs were selected to 
minimize the total number of misclassified samples. 

The accuracy of the composite screening algorithm on the calibration and 
prediction data sets is illustrated in the confusion matrix in Table 6. Examination of the 
confusion matrix indicates that the algorithm correctly classifies approximately 90% of 
high grade SILs and 75% of low grade SILs from the calibration data set. Furthermore, 
approximately, 80% of normal squamous tissues and 70% of normal columnar epithelia 
from the calibration set are correctly classified. Evaluation of the prediction set indicates 
that there is a small change in the proportion of correctly classified high grade SILs and 
low grade SILs. There is a negligible change in the correct classification of normal 
squamous and columnar tissues. Note that while 80% of samples with inflammation 
from the calibration set are incorrectly classified as SIL, only 43% of these samples from 
the prediction set are incorrectly classified. 
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Table 6. Accuracy of the original composite algorithm which differentiates SILs and non 
SILs from the calibration and prediction sets. The first column corresponds to the 
spectroscopic classification and the first row corresponds to the histo-pathologic 
classification. 

A comparison of the accuracy of the composite screening algorithm (Table 6) to 
that of each of the constituent algorithms (1) (Table 3) and (2) (Table 4* on the same 
spectral data set indicates that in general, there is less than a 10% decrease in the 
proportion of correctly classified SILs using the composite screening algorithm relative 
to using either of the constituent algorithms independently. Note, however, that the 
proportion of correctly classified normal (squamous and columnar) epithelia is 
substantially higher using the composite algorithm relative to using either of the 
constituent algorithms independently. These results confirm that utilization of a 
combination of the two constituent algorithms, significantly reduces the false-positive 
rate relative to that using each algorithm independently. Evaluation of the 
spectroscopically misclassified SILs from the calibration set (Table 6) indicates that only 
one sample with CIN III, three with CIN II, two with CIN 1 and four with HPV are 
incorrectly classified. From the prediction data set (Table 6), two samples with CIN III, 
four with CIN II, three with CIN I and one sample with HPV are incorrectly classified. 



25 



A composite diagnostic algorithm which differentiates High Grade SILs from 
non-High Grade SILs: A composite diagnostic algorithm which differentially detects 
high grade SILs was developed using a combination of all three constituent algorithms: 
algorithm (1) which differentiates SILs. from normal squamous tissues, algorithm (2) 
which differentiates SILs from normal columnar epithelia and algorithm (3) which 
differentiates high grade SILs from low grade SILs. The optimal costs of 
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misclassification of SIL was equal to 0:87 for algorithm (1) and 0.65 for algorithm (2); 
the optimal cost of misclassiflcation of high grade SIL was equal to 0.49 for algorithm 
(3). Only the costs of misclassiflcation of SIL of constituent algorithms (1) and (2) and 
the cost of misclassiflcation of high grade SIL of constituent algorithm (3) were altered 
during development of the composite diagnostic algorithm. These costs were selected to 
minimize the total number of misclassified samples. 

The results of the composite diagnostic algorithm on the calibration and 
prediction sets are shown in the confusion matrix in Table 7. The algorithm correctly 
classifies 80% of high grade SILs, 74% of low grade SILs and more than 80% of normal 
epithelia. Evaluation of the prediction set using this composite algorithm indicates that 
there is only a 3% decrease in the proportion of correctly classified high grade SILs and 
a 7% decrease in the proportion of correctly classified low grade SILs. There is less than 
a 10% decrease in the proportion of correctly classified normal epithelia. A comparison 
between the calibration and prediction sets indicates that while more than 70% of 
samples with inflammation from the calibration data set are incorrectly classified as high 
grade SIL, only 14% of samples with inflammation from the prediction set are 
incorrectly identified. Due to the relatively small number of samples examined in this 
histo-pathologic category, the results presented here do not conclusively establish if the 
algorithm is capable of correctly identifying inflammation. 
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Table 7. Accuracy of the original composite screening algorithm which discriminates 
between high grade SILs and non-high grade SILs from the calibration and prediction 
sets. The first column corresponds to the spectroscopic classification and the first row 
corresponds to the histo-pathologic classification. 
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A comparison of the accuracy of the composite diagnostic algorithm to that of 
constituent algorithm (3) which differentiates high grade SILs from low grade SILs 
(Table 5) indicates there is less than a 5% decrease in the proportion of correctly 
classified high grade SILs and a 5% increase in the proportion of correctly classified low 
grade SILs using the composite diagnostic algorithm relative to using the constituent 
algorithm (3). Evaluation of the high grade SILs from the calibration set (Table 7) that 
were incorrectly classified indicates that three samples with CIN III and four with CIN II 
are incorrectly classified. From the prediction set, four, samples with CIN III and five 
with CIN II are incorrectly classified. 

"Reduced-parameter" composite screening and diagnostic algorithms: 
Component Loadings. A component loading represents the correlation between each 
principal component and the original pre-processed fluorescence emission spectra at a 
particular excitation wavelength. FIGS. 9A, 9B and 9C illustrate component loadings of 
the diagnostically relevant principal components of constituent algorithm (1) obtained 
from normalized spectra at 337, 380 and 460 run excitation, respectively. FIGS. 10A, 
10B and 10C display component loadings that correspond to the diagnostically relevant 
principal components of constituent algorithm -(2) obtained from normalized, mean- 
scaled spectra at 337, 380 and 460 nm excitation, respectively. Finally, FIGS. 1 1 A, 1 IB 
and 11C display the component loadings corresponding to the diagnostically relevant 
principal components of constituent algorithm (3), obtained from normalized spectra at 
337, 380 and 460 nm excitation, respectively. In each graph shown, the abscissa 
corresponds to the emission wavelength range at a particular excitation wavelength and 
the ordinate corresponds to the correlation coefficient of the component loading. 
Correlation coefficients of the component loading above 0.5 and below -0.5 are 
considered to be significant. 

FIGS. 9A, 1 OA and 1 1 A display component loadings of principal components of 
constituent algorithms (1), (2) and (3), respectively, obtained from pre-processed spectra 
at 337 nm excitation. A closer examination indicates that component loading 1 is nearly 
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identical for all three algorithms. Evaluation of this loading indicates that it is positively 
correlated with corresponding emission spectra over the wavelength range 360-440 nm 
and negatively correlated with corresponding emission spectra over the wavelength range 
460-660 nm. All remaining principal components of all three algorithms display a 
5 correlation between -0.5 and 0.5, except component loading 4 of algorithm (2) (FIG. 
10A) which displays a positive correlation of 0.75 with the corresponding emission 
spectra at 460 nm. 

Figures 9B, 10B and 11B display component loadings that correspond to the 
10 diagnostically relevant principal components of constituent algorithms (1), (2) and (3), 
respectively obtained from pre-processed spectra at 380 nm excitation. Component 
loading 1 of all three algorithms is positively correlated with corresponding emission 
spectra over the wavelength range, 400-450 nm. Between 500-600 nm, only component 
loading 1 of algorithm (2) (FIG. 10B) is correlated negatively with corresponding 
15 emission spectra. However, examination of component loading 3 of algorithm (1) (FIG. 
9B) and algorithm (3) (FIG. 1 IB) indicates that they are also negatively correlated with 
corresponding emission spectra from 500-600 nm. Only component loading 2 of 
algorithm (2) (FIG. 10B) is positively correlated with corresponding emission spectra 
from 500-600 nm. Also note that component loading 3 of algorithm (1) (FIG. 9B) and 
20 component loadings 3 and 6 of algorithm (3) (FIG. 11B) display a positive correlation 
with corresponding emission spectra at approximately 640 nm. 

FIGS. 9C, 10C and 11C display component loadings that correspond to the 
diagnostic principal components of constituent algorithms (1), (2) and (3), respectively 
25 obtained from pre-processed spectra at 460 nm excitation. Note that only component 
loading 1 displays a negative correlation (< -0.5) with corresponding emission spectra for 
all three algorithms. This component loading is correlated with corresponding emission 
spectra over the wavelength range 580-660 nm. The remaining principal components of 
all three algorithms display a correlation between -0.5 and 0.5. 
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The component loadings at all three excitation wavelengths of all three 
constituent algorithms were evaluated to select fluorescence intensities at a minimum 
number of excitation-emission wavelength pairs required for the previously developed 
constituent and composite algorithms to perform with a minimal decrease in 
5 classification accuracy. Portions of the component loadings of the three constituent 
algorithms most highly correlated (correlation > 0.5 or < -0.5) with corresponding 
emission spectra at each excitation wavelength were selected and the reduced data matrix 
was then used to regenerate and evaluate the constituent and composite algorithms. It 
was iteratively determined that fluorescence intensities at a minimum of 15 excitation- 

10 emission wavelength pairs are required to re-develop constituent and composite 
algorithms that demonstrate a minimum decrease in classification accuracy. At 337 nm 
excitation, fluorescence intensities at two emission wavelengths between 360-450 nm 
and intensities at two emission wavelengths between 460-660 nm were selected. At 380 
nm excitation, intensities at two emission wavelengths between 400-450 nm and 

1 5 intensities at four emission wavelengths between 500-640 nm were selected Finally, at 
460 nm excitation, fluorescence intensities at five emission wavelengths over the range 
580-660 nm was selected. Table 8a lists these excitation-emission wavelength pairs for 
each of the three constituent algorithms, (1), (2) and (3). These excitation-emission 
wavelength pairs are also indicated on the component loading plots in FIGS. 9-1 1. The 

20 bandwidth at each emission wavelength is 10 nm; 
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Algorithm (1) 


Algorithm (2) 


Algorithm (3) 


(*exc» ^emm) 


(^exc> ^emm) 




337,410nra 


337, 410 nm 


317 410 nm 


337, 430 nm 


337, 430 nm 


337 430 nm 


337, 460 nm 


337, 460 nm 


337 460 nm 


337, 510 nm 


337, 510 nm 


337 ^ 1 A nm 


337, 580 nm 


337 580 nm 


337 

j j /, 3oU nm 


380, 410 nm 


380, 410 nm 


->ois s *r i u nm 


380, 430 nm 


380. 430 nm 


jou, *ou nm 


380, 460 nm 


380 460 nm 


jou, *tou nm 


380, 510 nm 


380. 510 nm 

W, A W lull 


jou, j iv nm 


380, 580 nm 


380, 580 nm 


380, 580 nm 


380, 640 nm 


380, 600 nm 


380. 640 nm 


460, 510 nm 


460, 510 nm 


460, 510 nm 


460, 580 nm 


460, 580 nm 


460, 580 nm 


460, 600 nm 


460, 600 nm 


460, 600 nm 


460, 620 nm 


460, 620 nm 


460, 620 nm 


460, 640 nm 


460, 660 nm 


460, 640 nm 



Table 8a. Fluorescence intensities at 18 excitation-emission wavelength pairs needed to 
re-develop the three constituent algorithms (1), (2) and (3) with a minimal decrease in 
classification accuracy. 



Reduced-parameter composite algorithms: Using the fluorescence intensities 
only at the selected excitation-emission wavelength pairs, the three constituent 
algorithms were re-developed using the same formal analytical process as was done 
previously using the entire fluorescence emission spectra at all three excitation 
wavelengths (FIG. 2). The three constituent algorithms were then independently 
optimized using the calibration set and tested prospectively on the prediction data set. 
They were combined as described previously into composite screening and diagnostic 
algorithms. The effective accuracy of these reduced-parameter composite algorithms 
were compared to that of the full-parameter composite algorithms developed previously 
using fluorescence emission spectra at , all three excitation wavelengths. 
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Excitation, Emission 


Old Bandwidth (nm) 


New Bandwidth fnirri 


337 nm, 410 nm 


10 


80 


337 nm, 430 nm 


10 


Eliminated 


337 nm, 460 nm 


10 


20 


337 nm, 510 nm 


10 


60 


337 nm, 580 nm 


10 




380 nm, 410nm 


10 




380 nm, 430 nm 


10 


1* 1 1 m i n otoM 


380nm, 510nm 


10 : 


ou 


380 nm, 460 nm 


10 




380 nm, 580 nm 


10 


IU 


380 nm, 600 nm 


10 


10 


380 nm, 640 nm 


10 


10 


460nm,510nm 


10 


10 


460nm, 580nm 


10 


10 


460 nm, 600 nm 


10 


10 


460 nm, 620 nm 


10 


10 


460 nm, 640 nm 


10 


10 


460 nm, 660 nm 


10 


10 



10 



15 



Table 8b. Fluorescence intensities al 15 excitation-emission wavelength pairs needed to 
re-develop the three constituent algorithms (1), (2) and (3) with a minimal decrease in 
classification accuracy. 

Table 8b contains fluorescence intensities at 15 of the previous 18 excitation-emission 
wavelength pairs needed to redevelop the three constituent algorithms with a minimal 
decrease in classification accuracy. This table indicates that three variables are 
eliminated and the bandwidths of intensities at four excitation-emission wavelength pairs 
are increased by approximately a factor of four. These results establish that a further 
reduction in the number of emission variables and an increase in bandwidth minimally 
affect the classification accuracy of the algorithms. The benefit of eliminating the three 
emission variables and increasing the bandwidth of four emission variables is that it can 
reduce the total integration time needed to measure the fluorescence parameters from the 
tissue. 
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Table 9 displays the accuracy ■ of the reduced-parameter composite screening 
algorithm (based on fluorescence intensities at 15 excitation-emission wavelength pairs) 
which discriminates between SILs and non-SlLs applied to the calibration and prediction 
sets. A comparison between the calibration and prediction data sets indicates that there is 
5 less than a 10% decrease in the proportion of correctly classified SILs and normal 
squamous tissues from the prediction set Note however that there is a 20% increase in 
the proportion of correctly classified normal columnar epithelia and a 40% increase in 
the proportion of correctly classified samples with inflammation from the prediction set. 



Classification in 


Normal 


Normal 


Inflammation 


LG SIL 


HG SIL 


Calibration Set 


Squamous 


Columnar 








Non SIL 


73% 


46% 


13% 


17% 


15% 


S1L 


27% 


54% 


87% 


83% 


85% 


Classification in 


Normal 


Normal 


Inflammation 


LG SIL 


HG SIL 


Prediction Set 


Squamous 


Columnar 








Non SIL 


72% 


64% 


50% 


25% 


11% 


SIL 


28% 


36% 


50% 


75% 


89% 



10 

Table 9. Accuracy of the reduced composite screening algorithm which differentiates 
SILs and non-SILs from the calibration and prediction sets. The first column corresponds 
to the spectroscopic classification and the first row corresponds to the histopathologic 
classification. 

15 

The accuracy of the reduced-parameter composite screening algorithm (Table 9) 
was compared to that of the full-parameter composite screening algorithm (Table 6) 
applied to the same spectral data set. A comparison indicates that in genera] there is less 
20 than a 10% decrease in the accuracy of the reduced-parameter composite algorithm 
relative to that of the full-parameter composite screening algorithm, except for a 20% 
decrease in the proportion of correctly classified normal columnar epithelia from the 
calibration set tested using the reduced-parameter composite screening algorithm (Table 
9). 

25 

Table 10 displays the accuracy, of the reduced-parameter composite diagnostic 
algorithm that differentially identifies high grade SILs from the calibration and 
prediction sets. A comparison of sample classification between the calibration and 
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prediction data sets indicates that there is negligible change in the proportion of correctly 
classified high grade, low grade SILs and normal squamous epithelia. Note that there is 
approximately a 20% increase in the proportion of correctly classified normal columnar 
epithelia and samples with inflammation from the prediction set. 



Classification in 
Calibration Set 


Normal 
Squamous 


Normal 
Columnar 


Inflammation 


LG SIL 


HGSIL 


Non HG SIL 


79% 


62% 


40%. 


65% 


23% 


HGSIL 


21% 


38% 


60% 


35% 


77% 


Classification in 
Prediction Set 


Normal 
Squamous 


Normal 
Columnar 


Inflammation 


LGSIL 


HGSIL 


Non HGSIL 


82% 


86% 


64% 


63% 


20% 


HGSIL 


18% 


14% 


36% 


37% 


80% 



Table 10. Accuracy of reduced composite diagnostic algorithm which differentiates high 
grade SILs from non-high grade SILs from the calibration and prediction sets. The first 
column corresponds to the spectroscopic classification and the first row corresponds to 
the histo-pathologic classification. 

A comparison of the composite diagnostic algorithm based on the reduced 
emission variables (Table 10) to that using fluorescence emission spectra at all three 
excitation wavelengths (Table 7) applied to the same spectral data set indicates that in 
general, the accuracy of the reduced-parameter composite diagnostic algorithm is within 
10% of that reported for the full-parameter composite diagnostic algorithm. However, a 
comparison between Tables 7 and 10 indicates that there is approximately a 15% 
decrease and a 20% increase in the proportion of correctly classified normal columnar 
epithelia from the calibration and prediction sets (Table 10), respectively which were 
tested using the reduced-parameter composite diagnostic algorithm. The opposite trend 
is observed for samples with inflammation tested using the reduced-parameter composite 
diagnostic algorithm (Table 10). 



Table 11 compares the sensitivity and specificity of the full-parameter and 
reduced-parameter composite algorithms to that of Pap smear screening [5] and 
colposcopy in expert hands [9]. Table 11 indicates that the composite screening 
algorithms have a similar specificity and a significantly improved sensitivity relative to 
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Pap smear screening [5], A comparison of the sensitivity of the composite screening 
algorithms to that of colposcopy in expert hands for differentiating SILs from non SILs 
indicates that these algorithms demonstrate a 10% decrease in sensitivity, but a 20% 
improvement in specificity. The composite diagnostic algorithms and colposcopy in 
expert hands discriminate high grade SILs from non-high grade SILs with a very similar 
sensitivity and specificity. A comparison between the full-parameter and reduced- 
parameter composite algorithms indicates that the algorithms based on the reduced 
emission variables demonstrate a similar classification accuracy relative to those that 
employ fluorescence emission spectra at all three excitation wavelengths. 





SILs vs. NON SILs 


HG SILvs. Non HG SIL 


Classification 


Sensitivity 


Specificity 


Sensitivity 


Specificity 


Pap Smear 


62%±23 


68%±21 


N/A 


N/A 


Colposcopy in Expert Hands 


94%±6 


48%±23 


79%±23 


76%±13 


Original Composite Algorithm 


82%±1.4 


68%±0.0 


79%±2 


78%±6 


Reduced Composite Algorithm 


84%+ 1.5 


LJ>5%±2 


78%±0.7 


74%±2 



Table 11. Comparison of accuracy of composite screening and diagnostic algorithms to 
that of Pap smear screening and colposcopy in expert hands. 

CLINICAL METHODS 



In a clinical setting, the following exemplary steps are carried out to perform the 
composite screening algorithm in accordance with the present invention: 

The instrument (FIG. 1) is turned on and calibrated. Next, the prior 
probability that the patient to be measured has SIL is entered. This probability 
may be derived from statistics from the general population, or may be derived 
from patient-specific data collected, for example, from a prior colposcopy. Next, 
a speculum is inserted and the cervix is observed. Acetic acid may be applied to 
the cervix, if desired. 

The probe is directed to the cervix, ensuring that areas desired for 
screening will be illuminated. Multiple placements of the probe may be 
necessary. Using the probe, the cervix is illuminated with excitation at 
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approximately 337 nm, 380 nm and 460 nm. The probe will record resulting 
fluorescence data. 

Data from each spatial location assessed is analyzed to indicate whether 
5 the tissue is SIL or not. Analysis steps carried but include: 

1. Data recorded from each spatial location on the cervix is pre- 
processed in two ways: normalization, and normalization followed by 
mean scaling. Similarly pre-processed data obtained at each 

10 excitation wavelength are concatenated into a vector for each spatial 

location assessed. 

2. The normalized data vector from each site (Dn*) is multiplied by the 
reduced eigenvector matrix stored in memory (Cn*). Cn* contained 

15 only those eigenvectors which displayed statistically significant 

differences for samples to be classified by constituent algorithm 1. 

3. The posterior probabilities that a sample is SIL or normal squamous 
epithelium are calculated using Bayes theorem. In this calculation, the 

20 mean values and standard deviations of the PC scores for normal 

squamous epithelium and SILs and optimal costs of misclassifi cation 
stored in memory and the entered prior probability are used. 

4. The normalized, mean-scaled prediction data vector (Dnm*) is 
25 multiplied by the reduced eigenvector matrix from normalized, mean- 
scaled spectral data stored in memory (Cnm'). Cnm* contains only 
those eigenvectors which displayed statistically significant differences 
for samples to be classified by constituent algorithm 2. 

30 5. The posterior probabilities that a sample is SIL or normal columnar 

epithelium are calculated using Bayes theorem. In this calculation, the 
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mean values and standard deviations of the PC scores for normal 
columnar epithelium and SILs and optimal costs of misclassification 
stored in memory and entered prior probabilities are used. 

6. Using constituent algorithm 1, sites with a posterior probability of 
being normal squamous epithelium greater than a threshold value are 
classified as non-SIL. Remaining sites are classified based on the 
output of constituent algorithm 2. . Using constituent algorithm 2, 
sample with a posterior probability of being normal columnar 
epithelium greater than a threshold are classified as non-SIL. The 
remaining samples are classified as SIL. These tissue classifications 
may then be displayed in an easily understandable way, for example, 
by displaying an image of the cervix with the different tissue types 
displayed as different colors. 

To use the composite diagnostic algorithm of the present invention in clinical 
practice, the following exemplary steps arc carried out: 

The instrument (FIG. 1) is turned on and calibrated. The prior probability 
that the patient to be measured has SIL and HGSIL is entered. Once again, this 
probability may be derived from statistics from the general population, or may be 
derived from patient-specific data collected, for example, from a prior 
colposcopy. Next, a speculum is inserted and the cervix is observed. Acetic acid 
may be applied to the cervix, if desired. 

The probe is directed to the cervix, ensuring that areas desired for 
screening will be illuminated. Multiple placements of the probe may be 
necessary. Using the probe, the cervix is illuminated with excitation at 
approximately 337 nm, 380 nm and 460 nm. The probe will record resulting 
fluorescence data. 
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Data from each spatial location assessed is analyzed to indicate whether 
the tissue is HGSIL or not. Analysis steps carried out include: 

1. Data recorded from each spatial location on the cervix is pre- 
5 processed in two ways: normalization, and normalization followed by 

mean scaling. Similarly pre-processed data obtained at each 
excitation wavelength are concatenated into a vector for each spatial 
location assessed. 

2. The normalized data vector from each site (Dn*) is multiplied by the 
reduced eigenvector matrix stored in memory (Cn'). Cn* contained 
only those eigenvectors which displayed statistically significant 
differences for samples to be classified by constituent algorithm 1. 

3. The posterior probabilities that a sample is SIL or normal squamous 
epithelium are calculated using Hayes theorem. In this calculation, the 
mean values and standard deviations of the PC scores for normal 
squamous epithelium and SILs and optimal costs of misclassification 
stored in memory and the entered prior probability are used. 

4. The normalized, mean-scaled prediction data vector (Dnm*) is 
multiplied by the reduced eigenvector matrix from normalized, mean- 
scaled spectral data stored in memory (Cnnv). Cnm' contains only 
those eigenvectors which displayed statistically significant differences 
for samples to be classified by constituent algorithm 2. 

5. The posterior probabilities that a sample is SIL or normal columnar 
epithelium are calculated using Bayes theorem. In this calculation, the 
mean values and standard deviations of the PC scores for normal 

30 columnar epithelium and SILs and optimal costs of misclassification 

stored in memory and entered prior probabilities are used. 



10 



15 



20 



25 
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6. The normalized prediction data vector (Dn') is multiplied by the 
reduced eigenvector matrix from normalized spectral data of the 
calibration set (Cn'). Cn' contains only those eigenvectors which 
displayed statistically significant differences for samples to be 
classified by constituent algorithm 3. 

7. The posterior probabilities that a sample HGSIL or LGSIL are 
calculated using Bayes theorem. In this calculation, the mean values 
and standard deviations of the PC scores for HGSILs and LGSILs and 
optimal costs of misclassification stored in memory and entered prior 
probabilities are used. 

8. Using constituent algorithm 1, sample with a posterior probability of 
being normal squamous epithelium greater than a threshold are 
classified as non-SIL. Remaining samples are classified based on the 
output of constituent algorithm 2. Using constituent algorithm 2, 
sample with a posterior probability of being normal columnar 
epithelium greater than a threshold are classified as non-SIL. 
Remaining samples are classified based oh the output of constituent 
algorithm 3. Using constituent algorithm 3, samples with a posterior 
probability of being LGSIL greater than a threshold are classified as 
LGSIL. The remaining samples are classified as HGSIL. These tissue 
classifications may then be displayed in an easily understandable way, 
for example, by displaying an image of the cervix with the different 
tissue types displayed as different colors. 

The present invention has been described with reference to particular exemplary 
embodiments. However it will be understood by those of ordinary skill in this 
technology that additions, deletions and changes may be made to the exemplary 
embodiments without departing from the scope of the present invention. 
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APPENDIX I: SPECIFICITY AND SENSITIVITY 



Summarized from: Albert A., Harris E.K.: Multivariate Interpretation of Clinical 
Laboratory Data, Marcel Dekker Inc., New York, pp. 75-82, (1987), the disclosure of 
which is expressly incorporated herein by reference. 

Assuming a group of T samples which can be categorized as normal (N samples) 
or diseased (D samples). A diagnostic test, designed to determine whether the sample is 
normal or diseased, is applied to each sample. The results of the tests is the continuous 
variable x, which is then used to determine the sample type. Figure 22 illustrates a 
hypothetical distribution of test values for each sample type. A diagnostic method based 
on this test can easily be defined by choosing a cutoff point, d, such that a sample with an 
observed value x<d is diagnosed as normal and a sample with an observed value x£d is 
diagnosed as abnonnal. 

Several quantitative measures have been defined to evaluate 1 rhe performance of 
this type of method. The first type evaluates the test itself (i.e. measures the ability of the 
test to separate the two populations, N and D). Sensitivity and specificity are two such 
measures. The second type is designed to aid in the interpretation of a particular test 
result (i.e. deciding whether the individual test measurement has come from a normal or 
diseased sample). Positive and negative predictive value are two measures of this type. 

To define these measures, some terminology and notation must be introduced. 
Referring to Table 12, a sample to be tested can be either normal or diseased; the result 
of the test for each type of sample can be either negative or positive. True negatives 
represent those normal with a positive test result. In these cases, the diagnosis based on 
the rest result is correct. False positives are those normal samples which have a positive 
test result and false negatives are those diseased samples which have a negative test 
result In these cases, the diagnosis based on the test result is incorrect. 
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TABLE 12 





Normal 


Diseased 


Total Samples 


I Test Negative 
(x<d) 


True Negatives 
(TN) 


False Negatives 
(FN) 


Negatives (Neg) 


| Test Positive 
<x*J) 


False Positives 
(FP) 


True Positives 
(TP) 


Positives (Pos) 


1 Total Samples 


N 


D 


T 



With this terminology, Table 13 contains a definition of sensitivity and 
specificity, the two measures which assess the performance of the diagnostic method. 
Specificity is the proportion of normal samples with a negative test result (proportion of 
normal samples diagnosed correctly). Sensitivity is the proportion of diseased samples 
with a positive test result (Proportion of diseased samples correctly diagnosed). 
Specificity represents the area under the normal sample distribution curve to the left of 
the cut off point while sensitivity represent the area under the diseased sample 
distribution curve to the right of the cut off point. 



TABLE 13 



Test Measure 


Meaning 


Calculation 


Specificity 


Proportion of normal samples with 


Sp=TN/N 




negative test result 




Sensitivity 


Proportion of diseased samples 


Se=TP/D 




with positive test result 





While sensitivity and specificity characterize the performance of a particular 
method, another set of statistics is required to interpret the laboratory test result for a 
given specimen. The positive and negative predictive value quantify the meaning of an 
individual test result (Table 14). The positive predictive value is the probability that if 
the test result is positive, the sample is diseased. The negative predictive value is the 
probability that if the test result is negative, the sample is normal. Positive and negative 
predictive value are calculated from Bayes rule as outlined in Albert and Harris. Table 
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14 contains two equivalent formulas for calculation positive and negative predictive 
value. 



TABLE 14 



Measure 

Positive 
Predictive 
Value 


Meaning 

The probability that, if 
the test is positive, the 
sample is diseased 


Calculation 1 

PV + =TP/Pos 


Calculation 2 

PV<=DSe/(DSe+N(l -Sp)) 


Negative 
Predictive 
Value 


The probability that, if 
the test is negative, the 
sample is normal 


PV.=TN/Neg 


PV=NSp/(NSp+D(l-Se)) 



WO 97/48331 



PCT/US97/10729 



-59- 

APPENDIX II: PRINCIPAL COMPONENTS 



Principal Components of Full-Parameter Constituent Algorithm 1 which differentiates 
5 SILs from Normal Squamous Tissues. Results reported for calibration set: 



Legend 


PCI 


PC3 


PC7 


1 


0.630 


-0.576 


0.288 


1 


0.590 


-0.460 


0.294 


1 


0.902 


-0.849 


-0.034 


1 


1.150 


-0.678 


-0.104 


1 


-0.413 


-0.179 


-0.150 


1 


-1.190 


-01171 


-0.225 


1 


0.489 


-0 r 049 


-0.138 


1 


0.200 


-0.163 


-0.130 


1 


-0.889 


-0.457 


-0.104 


1 


1.060 


-0.256 


-0.253 


1 


1.290 


-0.360 


-0.143 


1 


-0.113 


-0.220 


-0.162 


1 


0.610 


-0.108 


-0.031 


1 


-1.460 


-0.554 


-0.129 


1 


0.468 


-0.314 


-0.262 


1 


1.290 


-0.422 


-0.093 


1 


0.174 


-0.690 


-0.156 


1 


0.428 


-0.798 


-0.225 


1 


1.290 


-0.742 


-0.362 


1 


1.410 


-0.530 


-0.154 


1 


0.284 


-0.518 


-0.331 


1 


2.220 


-1.400 


-0.137 


1 


1.160 


-0.191 


-0.116 


1 


0.231 


-0.099 


-0.247 


1 


1.640 


-0.271 


-0.249 


1 


0.538 


-0.179 


-0.112 




-0.864 


0.032 


0.118 




0.130 


-0.273 


-0.135 




0.152 


0.029 


-0.010 




-0.978 


-0.702 


0.095 




0.635 


-0.120 


-0.079 




1.660 


-0.683 


0.002 




0.934 


-0.401 


-0.150 




0.692 


0.015 


0.144 




0.018 


-0.363 


-0.094 




0.401 


-0.085 


0.071 




0.187 


-0.146 


-0.043 




0.132 


-0.142 


-0.175 
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Legend 
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Principal Components of Reduced-Parameter Constituent Algorithm 2 which 
differentiates SILs from Normal Columnar Tissues. Results reported for calibration set: 
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CLMMS 

A method of detecting tissue abnormality in a tissue sample comprising the steps 

(i) providing a tissue sample; 

(ii) illuminating said sample with electromagnetic radiation wavelengths of 
about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra; 

(iii) detecting a plurality of emission wavelengths from said fluorescence 
intensity spectra; and 

(iv) calculating from said emission wavelengths a probability that said sample 
is abnormal. 

The method of claim 1 , wherein the illumination wavelengths are in the ranges of 
317-357 nm, 360-400 nm and 440-480 nm. 

3. The method of claim 1 , further comprising the step of preprocessing data at the 
emission wavelengths to reduce inter-sample and intra-sample variation. 

20 

4. The method of claim 1 , wherein said establishing step comprises normalizing said 
spectra relative to a maximum intensity within said spectra. 

5. The method of claim 1, wherein said establishing step does not comprise mean- 
25 scaling said spectra. 

6. The method of claim 1, wherein said emission wavelengths are about 410 nm, 
about 460 nm, about 510 nm and about 580 rim for an illumination of about 337 nm; 
about 460 nm, about 510 nm, about 580 nm, about 600 nm and about 640 nm for an 

30 illumination of about 380 nm; and about 510, about 580 nm, about 600 nm, about 620 
nm, about 640 nm and about 660 nm for an illumination of about 460 nm. 



1. 

of: 

5 



10 
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7. The method of claim 1 , wherein said sample is illuminated in vitro. 



8. The method of claim 1, wherein said sample is illuminated in vivo. 

5 9. The method of claim 7, wherein said providing step comprises obtaining said 
tissue sample by biopsy. 

10. The method of claim 9, wherein said providing step further comprises generating 
a monolayer cell touch preparation or a pellet. 

10 

11. The method of claim 10, wherein said providing step further comprises ethanol 
fixation of said tissue sample. 

12. The method of claim 1, * wherein said iHumitiating comprises illuminating said 
15 sample. substantially normal to a surface of said sample, and wherein said detecting step 

comprises, detecting said spectra at an angle of approximately 20° from normal. 

13. The method of claim 1 , wherein said probability distinguishes normal squamous 
and abnormal tissue. 

20 

14. The method of claim 1, wherein said probability distinguishes normal columnar 
epithelium and abnormal tissue. 

15. The method of claim 1, wherein said probability distinguishes inflamed and 
25 abnormal tissue. 



16. The method of claim 1, wherein said probability distinguishes low grade SIL and 
high grade SIL tissue. 



30 



17. The method of claim 1, wherein said probability distinguishes normal and high 
grade SIL tissue. 



WO 97/48331 



PCT/US97/10729 



-95- 

18. A method of developing a model for differentiating between norma) and 
abnormal tissue in a tissue sample, comprising the steps of: 

(i) providing a plurality of tissue samples; 

(ii) illuminating said samples with electromagnetic radiation wavelengths of 
about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra for each of said samples; 

(iii) detecting a plurality of emission Wavelengths from said fluorescence 
intensity spectra; 

(iv) forming a set of principal components from said spectra, said principal 
components being defined as providing statistically significant differences 
between normal tissue and various forms of abnormal tissue; and 

(v) subjecting said principal components to logistic discrimination to develop 
a relevant model. 

19. The method of claim 18, wherein the illumination wavelengths are in the ranges 
of 3 1 7-357 nm, 360-400 nm and 440-480 nm.,., 

20. A method of detecting tissue abnormality in a tissue sample comprising the steps 
of: 

(i) providing a tissue sample; 

(ii) illuminating said sample with electromagnetic radiation wavelengths of 
about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra; 

(iii) detecting a plurality of emission wavelengths from said fluorescence 
intensity spectra; and 

(iv) establishing from principal components PCI, PC3 and PC7 of step (iii) a 
probability that said sample is abnormal. 
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21. A method of detecting tissue abnormality in a tissue sample comprising the steps 
of: 

(i) providing a tissue sample; 
5 (ii) illuminating said sample with electromagnetic radiation wavelengths of 

about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra; 

(iii) detecting a plurality of emission wavelengths from said fluorescence 
intensity spectra; and 

10 (iv) establishing from principal components PCI, PC2, PC4 and PCS of step 

(iii) a probability that said sample is abnormal. 

22. A method of detecting tissue abnormality in a tissue sample comprising the steps 
of: 

15 

(i) providing a tissue sample; 

(ii) illuminating said sample with electromagnetic radiation wavelengths of 
about 337 nm, about 380 nm and about 460 nm to produce three 
fluorescence intensity spectra; 

20 (iii) detecting a plurality of emission wavelengths from said fluorescence 

intensity spectra; and 

(iv) establishing from principal components PCI, PC3 PC6 and PC8 of step 
(iii) a probability that said sample is abnormal. 

25 23. A method of detecting tissue abnormality in a subject in vivo comprising the steps 
of: 



30 



(i) illuminating a tissue sample in vivo with a plurality of electromagnetic 
radiation wavelengths to produce a plurality of fluorescence intensity 
spectra; 
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(ii) detecting a plurality of emission wavelengths from said fluorescence 
intensity spectra; and 

(iii) establishing from said emission wavelengths a probability that said 
sample is abnormal. 

24. The method of claim 23, wherein the plurality of illumination wavelengths 
includes at least three wavelengths having the ranges of 317-357 nm, 360-400 nm and 
440-480 nm. 

25. The method of claim 23, further comprising the step of preprocessing data at the 
emission wavelengths to reduce inter-patient and intra-patient variation. 

26. A method for diagnosing cervical precancer in a patient comprising the steps of: 

(i) illuminating cervical tissue of said patient with a plurality of 
electromagnetic radiation wavelengths to produce a plurality of 
fluorescence intensity spectra; 

(ii) detecting a plurality of emission frequencies from said fluorescence 
intensity spectra; 

(iii) calculating from said emission frequencies a probability that said sample 
is abnormal; and 

(iv) diagnosing said cervical tissue as a function of said calculation. 

27. The method of claim 1, wherein said method further comprises illuminating said 
sample with an electromagnetic radiation wavelength in the near infrared sufficient to 
produce a Raman spectrum and detecting a Raman spectrum therefrom. 

28. The method of claim 27, wherein the near infrared illumination wavelength is 
about 789 nm. 
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29. The method of claim 28, wherein said emission frequencies are shifted about 626, 
818, 978, 1070, 1175, 1246, 1330, 1454 and 1656 cm 1 from said illumination 
wavelength. 

30. The method of claim 27, wherein at least one of said emission frequencies is 
associated with compound selected from the group consisting of collagen, phopholipids 
and glucose- 1 -phosphate. 

31. An apparatus for detecting tissue abnormality, comprising: 

(i) a light source for emitting a plurality of electromagnetic radiation 
wavelengths; 

(ii) a probe connected to said light source, said probe adapted to apply said 
plurality of electromagnetic radiation wavelengths to tissue under test and 
to gather fluorescence emitted from said tissue under test; 

(iii) detection means, connected to said probe, for detecting at least one 
fluorescence spectrum emitted from said tissue under test; 

. (iv) a programmed computer connected to said detection means, for 
processing said at least one fluorescence spectrum according to a 
predetermined algorithm to establish a probability that said tissue under 
test is abnormal. 

32. The apparatus of claim 31 , said light source comprising a nitrogen pumped laser, 

33. The apparatus of claim 32. said plurality of electromagnetic radiation 
wavelengths comprising about 337 nm, about 380 nm and about 460 nm. 



34. The apparatus of claim 31, further comprising a polychromator connected 
intermediate said probe and said detection means. 
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35. The apparatus of claim 31, said probe comprising excitation optical fibers for 
applying said plurality of electromagnetic wavelength to said tissue under test, collection 
optical fibers for gathering said fluorescence emitted from said tissue under test, and a 
shield overlying ends of said excitation optical fibers and said collection optical fibers. 

5 

36. The apparatus of claim 32, said predetermined algorithm including principal 
components that predict statistically relevant differences between fluorescence emission 
wavelengths from normal and abnormal tissues for said plurality of applied 
electromagnetic radiation wavelengths. 

10 

37. The apparatus of claim 36, wherein said abnormal tissues arc selected from the 
group consisting of inflamed, low grade SIL and high grade SIL. 
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